'document besoue 



ED 2.11 593 

AOTHOE. 
TITLE 

INSTITUTION- 

SPONS AGENCY 
PUE DATE 
•G£ANT 
NOTE' 

EDPS PEICE 
3E-SCBIPT0BS 



1M e20 027. 

Choppin r Bruce H. ; And Others . 
Test. Ose Project, Annual 'Beport,* » 
California Oniy«. r los Angeles, Center fc£ the'Study 
of Evaluation,.' \ \ 
National Ijist, of' Education (ED) r Was 4 hicgton r D,C 
Nov 81 . , . ' • . . , ^ • • 

NIE-G-8,0-0112 

1<l2p. # < 

MF01/PC06 Plu£ Postage, . > \ » 

♦ Elementary Secondary Education ;, Grade ki Grade 6; 
Grade 10; Language Arts; .Mathematics Achievement; 

♦ National Surveys;* Beading Achievement; *Teacher* 

4 % * . Attitudes; Testing Froblems; *Test Ose ^ ♦ \ • \ 

AB'STBACT ' ' : * 

One of. the twc majcr phases of tfte Test Use Froject 

•of the Center for the Study of Evalautiop (CSE) * is discussed; that 
is r th6 collection and analyses of survey data from a natipnal ;sample : 
of teachers an3 principals representing the targeted grades/ schctols.. 
Som$ historical background influencing that)phase of the project is 
provided, T?he findings emanating frcm CSE ( s - 1 978*small-£cale study of 
testing— which in some ways was the .primary inception fcr the 

-stu#5^-ate described* The findings of fieldwork preceding the 
natioral survey «are'discussed and principal findings o,f the survey 
are presented. The su'riey included questions about the teacher's , * 
professional background, clasj&room character jsticfe r use of resources, 
district assistance, district/trainiW and ■ college ccurses r district 
uses of assessment information, dis.tr:i6t repbrting of test results, * 
teacher Attitude toward tests and t.est^elated issues, and teacher 
uses of assessment results, (Author/GK) 



ERLC 



*********** ********** ************************************************** 

* ' : Bepr^ductiorfs^supplied by EDBS are the tttest that can be made * * 

* * % *f ic^onf the oFigiftal document,, / * 



for the Study of Evaluation 



UCLA Graduate School of Education 
Los Angeles, California'OT^; ' 




U*{ DEPARTMENJ OF EDUCATION 

NATIONAL INSTITUTE OF EDUCATION 
EOUCATKJNAL RESOURCES INFORMATION 

(jt^ center (ERio 

fna docgmftrjt has been reproduced as 
received from' Jhe person or organization 
originating it , ^ 

U Mrfior changes have been made to improve 

1 reproduction quality " ' ' 

t Points of view or opinions stated in this docu 
ment do not necessarily represent official NIE 
position or policy. 




) 



■■■ ■■■ 

— mm . h 



1 2 



"PERMISSION TO REPRODUCE THIS 
MATERIAL HAS BEEN GRANTED -BY 

IT, 



TO THE EDUCATIONAL ^SOURCES 
INFORMATION CENTER <piCl M * 



■J . ■ 




" DELIVERABLE - November 1981 

TEST ifSE PROJECT. ■ 
Annual Report 

' • * ' v 

Bruce H. Choppin, Project Co- Director 

Donald W. Dorr-Bremme\ Prbject Co-Director 
James 'Burry, Senior Research Associate 



4 



Grant Number 
NIE-G-80-0112 
P-2. 



V 



CENTER f OR THE STUDY OF EVALUATION 
Graduate School of Education 
University of 'California . Los Angeles 



— f 



The project presented or reported herein was, 
performed pursuant to a grant from the National 
Institute of Education, Department of Education 
However, the opinions expressed herein do not 
necessarily reflect the position or policy of 
the National Institute of Education, and no 
official endorsement by the Nationa-1 .Institute 
of Education should be inferred. 



A 



Tatxfg of Contents 



Introduction N 

Project History ■ 

Planning Activities ^ * 

The Research Model Guiding the Test Use Project 

Findings from the 1978 Study - - - 

TheJtelume of Testing Occurring in Schools 
"The Extent to Which Teachers Use Test Results 
Teachers 1 Knowledge of and Attitude Toward Tests 
Factors Influencing the Use of Test Results 
Summary ' 

The Exploratory Fieldwork *— ^ 

Intentions 

General Findings 

Range of Tests Administered 

Range of Reported Uses 

Patterns of Assessment Results Use 

Relationships Between Types of Tests and Categories 

of Use 
District Narratives 

District One < 
District* Two 
District Three , 

Test Uses/Issues in District One 
. District Summary ' 
Test Uses/Issues in District Two 
District Summary 

Test Uses/Issues in DistHct Three 
* * District Summary 

The National Survey 

S t 

Sampling Methodology 

The Initial Plan 

Revised Sampling Design 

First Stage (Selection of Districts) 

Stratification ' 
f Sampling Frame ^ 
' Selection Procedure . 4 

Second Stage (Selection of Schools) * 

ThirdStage (Selection of Teachers' 



( 



The Elementary School Teacher .Sample 

Teachers 1 Professional Background 
Classroom Characteristics 
Use of Resources 
District Assistance 
District Training/College Courses 
District Uses of Assessment Information 
District Reporting of Test Results 
"Teacher Attitude Toward Tests .and Test-Related. 
Issues 

Teetfher Uses of Assessment Results 1 

The Secondary -School Teacher Sample? : 

Teachers 1 Professional* Background 
Classroom Characteristics 
Use of Resources 
District Assistance 
District Traininjg/College Courses 
District Uses of Assessment Information 
District Reporting of Test Results 
Teacher Attitude Toward Testing and Test-Related 
Issues 

Teacher ^ses of Assessment Results 



INTRODUCTION 

CSE's Test Use Project has been gathering information bearing on 
a range of testing issues for students, teachers, administrators-, 
researchers, and policy makers, -It is clear that our s'cWil^do^ 
great deal of student achievement testing; and some limited information 
has already been collected on-certain practices affecting our students 
in some areas of the country. Until the CSE study, however, we have 
lacked information that is -nationally representative and illustrative of 
the entire range of tests being administered, and yet^which is sufficiently 
focused to' be of use in test-based- policy matters. 

» * • 

CSE has been concerned, first', that we have been lacking' 
descriptive data reflecting the entire testing picture— the range of 
tests being administered, their "associated users 'and consumers-, aj\d the 
range of students 'affected by particular kinds of tests. Second, we 

have also lacked the more- inferential utilization data— the primary 

-»». 

and secondary users of test information, the intended and actual uses 
of test information, variations in use across users and organizational 

« ' • ^? 

r 

settings, the kinds of decisions made on the bas^fs of 7 test information, 
the k4nds of students thereby afffected, and the attendant costs of the 
testing enterprise. ' 0 ^ 

Since the inception of the Test Use Project in December. 1979, we 
have been examining these kinds' tffrssiies in a bro^d framework which 
defines testing to include formal tests, both norm- and cri£j»rion- 
referenced; curriculum-embedded measures ; district-, school, and teacher- f 
developed tests j as well as the more informal measures such as teacher 
quizzes, observations, and other interactions with students. In short, 



our study has not, aimed at any single kind of test, user, or student. 7 
But the study- is also sharply focused in this broad framework, and 
.examines some of the more troublesome aspects of testing: student achieve 
ment testing in language arts and mathematics; at selected grade levels 
where testing may, critically affect large numbers .of students and their 
teachers— fourth and sixth graces in elementary schools and'tenth grade 
in high schools;. w4th : emphasis on first- and second-orders of test u$e 
CB'aker, 19781. Finally, information on these matters has been primarily 
reported to us by teachers and- principals— those who are closely involved 
Jn first- and second-order uses of tests . «' • * 

v The T&t Ifee Project has *feen proceeding in two overlapping ^ • 
phases. Phase I.vteking place between December 1979 and November 1981, 
has culminated in the collection and analyses of survey data from a 
national sample of teache/s ffnd 'principals representing the targeted 
grades/schools.. During Phase JI of the study, which*began in 
Fehruary 1981 and' will conclude in November 1982, the project will' be 
conducting on-site studies in a small number of schools. The primary 
intention of this phase of the study is to identify the'direct and , 
indirect costs of testing, with the secondary intention of pursuing 
salient findings of the Phase X work and expanding, the contextual 
base critical to interpretation of its survey data. ^ 

In our work thus far'we have developed and .refined the conceptual , ( 
scheme informing our work; reviewed and reported on the relevant 

literature; conducted preliminary fieldwork in schools to pilot-test 

v 

questions about testing with teachers and principals; drawn a nationally 
representative sample of teachers and principals from the target grade 
levels; pilot-tested and sent out questionnaires to the sample; 



analyzed the data resulting from that sample; and planned the conceptual • 
framework- for our Phase II activities. + x 

When all analyses of the Phase I survey data have been completed, 
we plan to begin dessemination of results to teachers, principals, and 
other administrators, planners and policy makers, researchers, and 
testing specialists. Dissemination will continue through the Phase II 
cost study which, when completed, wilt relate testing practices to a 
range of monetary, opportunity, and psychological costs* Our findings 
should have a bearing not only on testing practices and test-based 
decisions al?out individuals and groups of students, but also on test * 
related policymaking and. school practices including test selection, 
development, and use, as well as teacher inservice in these areas/ 

Since this report discusses one of the Test Use Project 'js two 
major phases, we will provide some of the historical background influencing 
that phase of our work and which led up to this document.. We will also 
describe some of the findings emanating "from, CSE's 1978 small-scale 
study of testing which in many ways was tjie primary 'inception for the 
present study; continue the findings in a discussion of the field- 
work which preceded the national survey, and present the principal findings 
of the national survey. 



4 / 



PROJECT HISTORY 



Planning Activities . * . v 

There is~little doubt that testing in our schools'has been ^ 
increasing in response to federal and state* program assessment* 
requirements, accountability concerns! national and regional assessment 

needs, state mandated minimum "competency requirements, and the expansion 

* * 
of curriculum. embedded'" testing programs, 

\ * « 
As v/tth other highly visible activities, testing has become the 

subject of much controversy, and the legal and political systems have 

entered the debate. Testing proponents have argued tbat tests contribute- 

to educational quality controls help in providing -individualized 

-instruction to students, and assis^ln improved educational depisi'on 

making. Critics. of testing, on.the other hand, have described the 

♦ 

arbitrary nature of, current testing practice, have* challenged tests for 
thetfr biased properties, and questioned their appropriateness to 
contemporary education and its changing Yunctions. 

While there is some empirical information available about testing- 
six, full standardized test batteries, on ana/erage, are taken by a 
student during his or her school years, at'least 90 percent of t the 

LEAs in the 4 country administer standardized tests to their students, 

• * * 

over 40 states conduct a state assessment program and/or have- adopted 1 

*» *■ « * 

-minimum competency legislation—we ' tiave been lacking nationally t 
representative information -about" the nature of. this vast amount of 
testing and how it is or is not being ased in schools. CSE's Test Use 
Project has been collecting information' to answer these questions over 
the past two years.. That is, we have-been attempting to document how 

• » . 10 ^ - 



3 



' , - . . * . . < 5 

*' . ^ 

much testing is- going on in schools^ what kinds of tests are 'administered 
and 'with what frequency, which of these tests are used or not used in 
the decisions, affecting schooling, and at what costs, ^n addition,- we t 
have been examining the coordinate, issue of the contextual factors • ' . 
which influence the 'administration of tests and. the use of tests for 
instructional decision making. 

The framework we devised to investigate these matters -suggested 
that in order to understand current testing practices, we needed to have, 1 
for each type of "test administered, information concerning its' intended , 
purposes, its characteristics, the context of, administration , the 
actual use of its results, and the costs. This framework enabled- us 
• ' - not only to describe the- nature of testing, but also to explore 
relationships among the surrounding components listed above. * 

Within the framework described above, then, we have been. gathering 
" ' information on testing practices, test uses, and testing costs over the 
Vtwo phases of the project. ' In each of these" phases , ; our research became 
i progressively more focused, beginning with wide-ranging inguiry to 
\ provide a comprehensive view of relevant, phenomena aid perspectives", 
'\ followed by the design of specific study questions and instruments to • 
answer them, and finally the collection and analyses of data collected 
on" the quest-ions of interest. ** 

Our planning activities, which were deyoted to refining and 
focusing the questions of the study and the framework within which they. 
" were pursued, had -several components: we re-examined our previous 
' test use data collected' in 1978; we conducted a literature search an<( 
review to N examine research on testing'and test use and to, identify 

V 

\ 

• . ' • \ 

ERJC- . • .* 1r X 



.a 



V -7 a range of salient policy Issues; wfe c6nsulted researchers, test . ' < 

. * 4 

specialists, school-level practitioners, and administrators t>n the y , , 
policy issues and fod appropriate for th£ study; we conducted 
expl oratory _fi el dwork to assess' the relevance of the guiding framework 
v as a tool to provide us with information. on tests being administer^, the' - 
v • kinds of purposes they serve, and the A factors Influencing their 
v a<jb£fhi strati on and/or use. Thes/activ1ties helped us to explicate the 

jfuTJ range of tests and other assessment devices being administered and 

► ~ — ~— - - - - — 

the kinds of factors that'mlght influence test use.; -! % 

^Together, the Information Stemming from each df our planning stages 

/ 

suggested that consideration of three basic questions was necessary to provide a 



rational structure for delimiting the emphases of the national survey: 

t What Issues -and questions about educational testing presently 
• % * /* confront those who make educational testing policy? 

; « v What information is presently available to inform those 

.*qu^$tions and issues? What kinds of informatiomgapS remain? + 
Of that information, which will be most useful? ~ ^ 

• Of that"'useful information, which can be obtained at an 
'appropriate level of spepificity within the scope of the CSE 
project and its available, resources? * v . * • 

These questions— concerned, with Issues in educational testing, status of 

l current information on testing and test use, and definition of bur 



research problem-jptryctured our thinking about directions for the national* 

servey. . For example, tHe matter of current Issues in testing raised 

a variety ofxquestlons .of potential' relevance to the survey and to 

M policy makers 1n a^ variety.of test-related areas. The matter of the 

emergence and proliferation of competency "testing is one such question. ' 

» • 

With more.than forty states operating minimum competency testing 
programs, some of which require the tests for promotion and graduation 



ERIC . • pr ' ' , " 



\ * ' 



• ' *°\ 

\ 

S 

and others simply for. checking students' basic educational progress* it'^ 
seemed to us that v decision majors at all .levels'heed to know if and 
how these programs influence students' educational experiences and 
life chances, and if they do, toVhat extent and how equitably. Policy 
^makers are also concerned with externally requir^tf testing for program . 
evaluation, with its delated concerns of accountability and compliance 
and the degree to Which it may serve other educational purposes. Another, y 
matter concerns district continuum testing and its- quality and effective- 
ness in improving local instruction, Teactier constructed tests "and 
other assessment techniques comprise anotheromportant issue since 



teachers seem to spend significant amounts of their time administering 
their own tests and quizzes, .What are the qualities, of these tests 
that make them appear attractive and usefal to teachers? And can these 
qualities be incorporated in other tests and^testtng procedures? 
Finally, the area of current issues also reflects matters of equity, 
testing' costs/benefits, and potential misuse. Are certain kinds of 
students possibly being over-tested at the expense of receiving 
necessary instruction? Are. students in general being tested too much? 
Are particular; kinds of tests and testing programs worth the time,- t 
energy, and'money invested in them?- Which have the greatest -benefits 
and under which conditions? What .patterns and/or combinations of 
.testing provide the highest payoff in>terms of generating valid and 
reliable Information, at minimum' costs? ' 

The second of the three questions delimiting the' study reflected 
the, status of bur current information on testing and test use. What 
Information is currently available to inform those concerned with the 

-A 

kinds of decisions outlined above? Our literature review (see: Volume II, 



Test Use Report , November 19801 suggested that very tittle concrete 

• ' * / 

information presently exists.. < 

. . ♦ < ' ) ' - * 

^Opinion' and argumentation dominate the published material on 

v testing, Experts\debate the merits of norm-refferenced and criterion- 

referenced tests. Proponents and opponents of minimum competency 

testing -argue their. caseS, The cultural and linguistic bias of certain 

tests, are cited.- Calls appear for the' development •and $&e of alternative 

assessment "procedures and for more teacher training in testing. These 

and similar discussions have helped fq/puY'the issues that policy makers 

now face, but research to address those issues is in short supply . 

Few national studies on testing and test use have been conducted 

Those theft have been center orr teachers 1 attitucfes toward and use of' 

'norm-referenced, standardize tests (e.g, ,^Ebql>, 1967;, Goslin, T967; 

» 

Ktrkland, 1971; Stetz & Beck, 1979).. - Tfiis emphasis recurs in most of 
the extant research- on 'testing ift particular states and localities. 
Cc.f.,' Angel, 1968'; Boyd, e£ al_, 1975; Hotvedt, 1978; Infantino, 1975; 
Rudma^SL978; Salmtfn-Cox, 1980), but- contemporary concerns in the area ' 
Of educational testing go ( well beyond standardized testing* Informa- 
tion is reguired^on a wide if^nge- o^^ests and assessment practices, 

1 ^Work Byjeh (1978), which is discussed later in this report^ and 
others suggests that thqse concerns about gathering information on a 
wide variety of assessment techniques. Is'valtd. Our test use exploratory 
fieldwp'rki also discussed later in this report, further pointed up \he 
relevance of these issues. . ' 

The appropriateness of giving attention to, and raising questions 

* * 

about, the full range/of' tests and other assessment procedures.' is also 
indicated in sociological studies of teaching Ce'.g., Lortie, 1975;', 



Kitsuse & Cicourel, 1963) and research ort teachers' decision making. 
Cc.f., Borko,'l978, Leiter, WVMehan, 1974; Shavelson, 1977. See 
also'Airasian, 1979). Yeh's (1978) work in test use provided us with 



a starting point for examining this range. The research of others', ' 

. • ' * \ ' , 

still in, progress, will also begin to extend understanding* of the 

current functions of different kinds, of testing Ce.g., Rudman, Kelly, 

* •, • ' 

Wanous, Mehrens, Clark', & Porter, 1980; Resnick & Resnick, 1978; 
Spfoull. & Zu brow $ 1 979-, National Evaluation Systems, 1978^-. At 
present, however, HttTe is , known abaut the uses and Impacts of teachers 

r> e . • 

C* 

observation- and interaction-based judgments or teacher-made assignments 
and* tests." The sameis true about the functions and influences of 1 
tests ^njbedded in commercially produced curricula. And the information 
on minimum competency testing, testing for. state and federal program 
evaluation, or the ob3ective-based testing accompanying district- //< 
mandated skills continua is equally limited. Asftle from the extant 
work on standardized testing, there are only a few, rather narrowly ? 

focused studies on one or another kind of test te.g., Carducci-Bolchazy, 

« 

1978; Grew & Whitney, 1978). ! . ~r 

The above overview of issues and available information leads to 
thfe third of the three delimiting questions of the study. That is, 
what should a national survey of testing practices and tfest use attempt 
to accomplish? Clearly, what policy makers and stakeholders in educa- 
tional testjn| now need most urgently is tj^ic, broadly based descriptive 
and inferential information. They need to know what is going on in 

0 

schools nationwide with respect to assessment of student achievement. 



More specifically, they need to know whajrfcgsts and ottier assessment 



^ ■ 15 



practices matter and in what circumstances tfhey matter .in American ' 

k ' 

public schools. * ; * |l 

Matter, as ysed h§re, is construed in two w&ys--one quantativfc, 

the other qualitative. In the former sense, a type of test or \* 
* \ 
assessment 'practice matters to the extent that 1t occurs widely in ' 

American schools.. Thus, our national survey of testing was concerned 

» 

with Identifying the types of assessment instruments and practices m 
which are administered generally and frequently. In the second* 
qualitative sense, a type of test or assessment practice matters to the 
extent that it has jmpaot. Thus, our survey of testing had also to , 
identify those types* of tests and practices that significantly influence 
the lives of students and the activities of practitioners in schools. 

Furthermore, our survey work would have to be attentive to two 
kinds of impacts orv*1nfl uences. Tests, of course, can Impact on the lives 
of students .(and their families) when scores from those tests counfc b$ 
major factors in decisions made about them,; e.g., placement and. : ; 
grading decisions. -Test scores can also influence students' lives 
and teachers' activities when they are used as criteria in evaluating, 
and changing Curriculum, allocating funds, or identifying teachers' 
professional needs for inservice training.. But the Test Use Project's 
exploratory field research in three school districts, as w,ill be seen 

V 

later in this report, has also called attention to the impacts that 

tests can have by virtue of their very presence as required- or recommended 

activities. 

In summary, tbe initial research problem for the CSE national 
survey of testing wa"s to document what types of assessment ^re, extant 



In American elementary and high schools and to. discern where particular 
types fall on the following "map": 

Figure T 



, HAVE 
IMPACT? 



. An Initial Maa of National Testing 
' and test Use 



Does a type of assessment: 
. OCCUR WIDELY? 

Most 



Most 



"Cell 1 

Occurs widely. 
Has . great 
impact 



"CeTTT 



V 



Recurs widely 
Least Mi's little > 
impact- ' 



Least 



Cell 2 

Does not occur widely 
Ha§ great impact 



Cell 4 

Does not occur widely 
Has little impact* 



This very basic information is currently lacking, as earlier discussion 
has. argued. 

Discovering how types of tests and other assessment practices array 
on the above "map" would indicate (Cells 1 and 3) which types are now 
consuming significant amounts of administrators', teachers', and students' 
time and energy— and (in rough approximation} public dollars. Research' 
toward this end was also intended to indicate which types of assessment 
1nstrument<T)md procedures bear most heavily on students' educational 
experiences and life chances and upon the professional activities of 
practitioners in schools. Simultaneously, then; such a "map" of tests 
would enabje those concerned with testing to identify the tests that 
matter most nationwide (Cell 1, as "mattering" has been defined here) and 



those that matter least tCell 4)— and jt would offer a rouglvbut-useful 

t ' * 7 

initial guide to those types of assessment activities foY which costs 

4 

may currently exceed benefits (Cell 3). Thus, the survey^would attempt 
to facilitate sorting and prioritizing the irange of issues and questions 
that confront those concerned with assessment of student achievement 
in its various forms, while providing a basic descriptive picture of 
assessment" activities. 

The second research problem for the national survey, as noted* 
above, was to identify arid describe in what circumstances particular 
types of assessment activity matter. The survey would,' therefore, seek 
datS so that thfe descriptive "map" in Figure 1 could be differentiated: 
^sp that patterns test use and impact under different contextual 
.conditions could b^flescrifced. Types of testsinay occur with a frequency 
and/or degree; and type of impact that varies from urban schools to rural, 
from schools serving the economically advantaged to those serving the 
economically disadvantaged, from classrooms 'where teathers are more 
experienced to those where they are less so. Achieving a differentiated 
description of testing and test use of this kind can rajult in. the 

identification of the factors that influence the use and impacts of 

* " ■ f 

particular kinds of *te3t^and other assessment practices. Consequently, 
the description should Afford an understanding of conditions that 
contribute to optimal-Use of particular 'kinds of tests and other assess- 
ment procedures. \ 

Summary . Earlier decisions led the Test Use Project's national 
survey to focus upon: 



13 



• Achievement testing in language arts/reading and 
mathematics at the upper elementary and high school 
grade levels. 

• Test uses of the first- and second-order (Baker, 1978), 
i.e., the uses of testing within schools. 

• Information on the latter as reported by classroom 
• teachers and principals. 



More specifically, the national survey would gather basic 
descriptive information on: 

• The frequency arid distribution of a broad%range of 
types of achievement testing and other achievement 
assessment practices. 

• The impacts of those types of testing and practices, 
N i.e., 

- the particular purposes for which test scores 
and other assessment results are used and 
their importance in serving those purposes, 

*» 

- the influences those types'of testing and. assess- 
ment practices have by virtue af their very 
presence as required or recommended activities, 

• The combinations of factors that Influence the uses and 
impacts of particular types of achievement tests and other 
Achievement assessment practices. - - 

Patterns of responses (o survey questions on the above issues Cas v 

seen tater in this document) will help ^yj^e basic d|ta on the benefits 

that accrue, for students and practitioners-' frijm of achievement" 

tests and assessment practices*, Negative^lmpacts cited by respondents 

will help to formulate' some of the costs of testing. In. Phase II of 

the Test Use Project, when follfcWrUp .field research occurs-, the monetary, 

opportunity* and a psychological costs of testing will be the focus of 

inquiry.- The -project's exploratory fieldwork confirmed the wisdom of 

this earlier decision. Even when using interviews in tHe field, checking 

cost information was extremely difficult. 



19 



Jt : 

At this point in buV planning activities, we began to approach 
the question of study design and data collection, and selection of .a 
research model most appropriate to our endeavors. 

The Research Model Guiding the Test Use Project 

. .One end-result of our' planning activities was. the selection of the. 

central questions which would guide the national survey. These questions 

were stated as follows: 

) 1. With what frequency and distribution are particular types 
J of test given in the upper elementary grades and high " 
— school? 

2. .In what ways do particular types of tests and testing 
impact upon school s 1 ami those within them; 

a. through their very presence, required or recommended? 

b. through utilization of their results? 

■ '3. What factors influence; % . 

a. where and how much particular types of testing are done? 

b. the ways, types of tests, testing, and test score use ♦ 
impact upon schools and those wi"thin them? 

As will be recalled from our 1980 Test Use report to the NIE, 

since our survey was intended to be both descriptive and analytic, we 

were concerned that our research meet ihe 'canons of descriptive validity. 

In selecting a theory of tHfe nature of the phenomenon to be described, 

the researcher imposes a reality, consisting of a set of constructs and 

statements of relationship" and function, on the. phenomenon being 

described. This imposition of reality occurs as the researcher attempts 

to describe the activities and events that are taking place and the 

manner in which they are taking place. In fact, when he/she makes 

decisions about what to select for description and what to omit, the 

< 



researcher imposes his/her own structure on what is "really" taking ". . 
plaice. It°is critical , then, that-the researcher's constructs and 
assumed relationships bear a resemblance to tfiose which participants in 
the phenomenon being described actually act upon. 'In brief, this means' 
that the researcher's description of the phenomenon be'ing studied 
should attempt to integrate both the researcher's and the participant^' 
orientations and conceptualizations. Thus, our survey design proceeded , 
from a conceptual standpoint that maintained contact with tlie ^orientations 
and purposes of educators in schools and at the same . time addressed 
our own -central policy and research concerns. 

As discussed in our 1980 report to the NIE, the study's conceptualiza 
Vion involved two interlinking concepts: that of the teacher as 

practical reasoner and "decision maker;**and that -of testing as an inter- 

» 

♦ 

ventioti. . 

As practical reasoners and decision makers, teachers orient their m 
activities^ to the practical tasks they must accomplish' in, their every- 
day routines and do so in light of the practical contingencies and • ^ 
exigencies -Jhey face. Teachers,, further, carry out^their activities * 
based on their understating of a "worlct known in commori arid taken for 
granted" (Schutz, 1962). Our planning stage activities can* be interpreted 
from this perspective. That teachers orient their- efforts to the practical 
tasks that are central, to their everyday lives and that they do orient 
to their practical -exigencies was recurrently documented in data 
gathered during our planning activities. Further, teacheVs rely on 
' consensually-supported and phenomenologically-based understanding to - 
carry out their tasks. 



Iff our 1980 annual report to the NIE, we cited evidence from our 
Yi'eldwork deroohstra ting that: < s 

* t Teachers report thejr-eses of^test results* as serving most 
heavily the functions that are at the core of . teaching-as- 
practiced. - 

• The means of assessment that teachers report using most 
often and in the greatest variety of ways are those which 
facilitate the accomplishment of their practical activities 
* under the exigencies thej? face, 

. * * 

. t Teachers tend to use least those tests which fit least 
well with th,e practical Remands* of their everyday world. 

§ For given activities arid decisions, teachers most often use 
the results of various types of assessment techniques collec- 
^ tively. Scores from one' test or one type of test rarely . 

serve alone as the basis. for v accomplishing a task. 



• Teachers orient to the routine constitutive tasks &nd . 
exigencies of teachlng-as-'pr.acticed. 

* ^ - . \ 
The second concept framing oui* projects survey inquiry was the & 

concept of testing as an intervention: That is, whether required or 

■ i ] 

recommended, tests, by virtue of their very presence in the teacher's 
world, can function as educational change agents. €ur planning stage 
fieldwork suggested that tests can function as such in any one of three 
^Vays: 



«3 



• Mandated tests can add new standards of accountability to 
the* practical exigencies teachers must attend .to in their 
everyday routines. - 

§ Mandated tests can change the .(practical circumstances under ;« 
which teaching andjearning must .be 1 accomplished. 

• Testing' programs of particular kinc^fcan facilitate accomplish- 
' ment of the routine tasks of teachings-practiced by 

responding to the practical exigencies teachers face. , 

(Evidence supporting .these^fndings, though previously discussed in our 
reports to the NIE, will once again be summarized in a subsequent section 

22 



17 



of this report., That section, in addition .to. summarizing, the fieldwork, 
will also discuss CSE test use data preceding the fieldwork and the ■ .• 
findings of the Test U«e Project's national survey<) 

In the foregoing discussion we have outlined tfje concepts of 
teacher as practical decision maker and testing as an intervention. 
These concepts served to orient the design of our national survey.- The 
two concepts converge to provide a grounded thebry of test use in 
schools and classrooms. It is a theory taking into account the purposes 
and constructs of participants in the phenomenon under examination, and 
it is a theory whiQh .permits issues to be addressed that* are centra*-*^ 
policymakers, stakeholders in the testing enterprise, and the community 
of Researchers studying educational testing. 

, This theofy of test use provided a heuristic for the informed 
selection of domains to be examined in our purvey research and indicated 
some relationships for Study among those domains. The^ domains were 
concerned with the following: * 

*■ Federal /state/local testing requirements 

Federal /state/local instructional programs 

Organization of curriculum and instruction 
V 

Types of students served 

Tfechers 1 perceptions of the utility of tests and types of tests 
Teachers 1 experience and training 

District and local site leadership action jp* . '•• 

Types of tests given: purposes and frequency ^ 
Types of test score use , 
Impacts. of tests 



23 



Thfi CSE national survey^ findings reflect the above kinds of domains. The 

readeirwho wishes to examine these findings is invited to resume reading 

of this .report at page 65, where the selection of the. national sample, 

^the^development of instrumentation; and discusSion^of its findings begins 

The reader who is interested in all of CSE's te?t use* findings, those 

— - • • . / 

vyhich led up to as well as those stemming, frgw the national survey, 

should continue below in the section dealing with our 197-8 studyT 



J 



f 

f 



24 



' 19 



FINDINGS 'FROM THE i978 STUqy 

As mentioned previously, one of CSE's early activities in gathering 
information on teachers' test practices and test use began in 1978. Two- 
hundred sixty teachers participated in this small -seal evstudy, represent- . 
ing 20 California elementary schools 'in fcrban, rural, and suburban areas 

and in Tftw«,aruk higher socioeconomic communities. The results of tlte^e" c 

. - f ;» . : " 

teachers 1 reports gave some^ preliminary answers to our questions**^: 

J V V. 

• The volume 6f testing occurring in schools. , • * 
t The extent to "which teachers use .test resTilts. ^ 
t Teachers 1 knowledge of and attitude toward' tests . 
Factors influencing the use of tests^ ( * 

The Volume of Testing Occurring in Schools 

* 

All schools in the study administered yearly state assessment tests 

k * > 

in grades one, two, three, and six, and all administered annual or semi- 
annual standardized norm-referenced test batteries to their students. A 
sizeable number were required, in addition^ to cffvT beginning and end of, 
year'assessments^ of a. criterion-referenced or district continuum variety, 
As wi'th all'California schools, the schools 'in the study were involved in 
required minimum competency testing. While this listing of*r.equjred tests 
is sizeable, 'it-is not exhaustive.' Other kinds of tests, teachers reported, 
constituted a much greater proportion of assessment activities in schools, 

* » 

One of the survey questions addressed those tests administered-if^JOTTnely 

i \ 

by fclassroom teachers in their itqrfoa^ instructional activities. Teachers 



> , . 20 

^ \ . • 

reported more frequent .testing in mathematics than in reading, but the 
frequency in both subject areas was substantial. A majority of the 
teachers reported giving wee'kly or daily mathematics tests, and eighty 
percent reported at least monthly mathematics testing. About one-third 
of the teachers administered weekly reading tests, and another third „ ' 
reported monthly-rending tests. Testing in both subject areas was less _ 
frequent in tahe primary graces than in the upper elementary levels. 

The' Extent to Which Teachers Use Test Results , 

The survey investigated use from two. perspectives: first, v/hat sources 
i>Of information we're used to make particular instructional decisions; and 
second, what use was made of test resul%? The first perspective inquired 
about the^use of. fc^sts relative to ojther sources of-available information; 
the second asked more directly' about the use of particular types of tests; 
but gave a more limited sense of relative value. • 



Teachers were a-ske4fewhat sources of information they used most fre- 
quently at the beginning of s the school year to assess student skills. 
Fifty-eight percent reported that test results were most important for 
initial reading placement, and 66 percent reported Using test results most 
'often foia-jnitial mathematics placement. 

While these findings implied that test results, and even those from 
required tests, provided important information early in the year, the 
picture changed as school got underway. When asked the sources of fnformati 
they used to assess student progress throughout the year; tea'chers reported 
relying mpst heavily oil interactions with students, informal -assessments 



2 6' , m 



> 



(e.g., oral quizzes, reading aloud), and the results oftteacher developed' 

tests. It. seemed that the results of * standardized tests were rarely used, 

* * i * 

* * . 

a,nd that curriculum embedded tests fared ,only slightly bette?. ^ 

Test results, then, seemed to provide the teacher with a quick and 

t, 

acceptable estimate of the ability of new students with whom the teacher 

was unfamiliar. However, once initial placemeats were fflbde and teachers 

became more .acquainted with their students > they stated that they were less 

likely to rely on standardized or curriculum tests as information about \ 

students 1 progress. 

A similar picture emerged when teachers were asked more directly about 

* 

how they u?e ^he results from their own tests and from required tests. 
Jeachers indicated that they usually used the results of their own* te$t§^ 
for several purposes: to make instructional decisions, to 'evaluate' the , 
effectiveness of their classroom program (e.g., teaching strategies, cur- 
riculam materials), and to provide information to others (e.g., parents',' 
other teachers). Teachers also reported using tests to assign grades, but with 

9 

somewhat less frequency. * t * 

In contrast, teachers .stated that they used the results./rdm required 
tests only iofrSquently for any of the* above purposes. .They seamed to use 
these tests relatively most often for reporting to parentjs or other staff " 
and for evaluating* the effectiveness of teaching methods and materials; 
but their reported frequencies were quite low. Required test .Quits' seemed- 

% - ■ P if 

to function for teachers as 3 standard of comparison whilfe* teacher made * 



tests repprtedly were used more for instructional decision mak^fig. 

• . • I: . ,, . ■ . 



fit 

V 



22 



Teachers ' Know! edge of- and^Ajtti tude 'Toward Tests ' 

Most teachers reported some tracing, e,g., college courses and in- 

service* sessions, in educational measurement. Thirty-nine percent reported' 

two or more college courses related to educational testing, While 23 ,per- 

cent reported no college courses in this area, A majority also reported 

• * 

at least one inservice course in testing. * ^ , * 

Despite this formal training in testing, however, teacher's responses 
about appropriate interpretations of common standardized test scores raised ,\ 
some questions about their levels of * understanding, .When presented with ^ * 
test results, only 50 percent of the teachers were able to interpret cor-- 
rectly percentile and grade equivalent scores— -the two methods most fre- 
quently used for reporting standardized test scores. 

. Sfirvey data about teachers' attitudes toward required testing were 
more consistent, Responsel about how teachers evaluated the costs vs, 
the benefits of testing, their reactions to discontinuing required test- 

ing, and their •opinions'of what required tests measure portrayed a some- 

# • 

what negative picture.* * 

! When asked to rate thejamount of classroom time spent in required 
testing relative^to the teacher and student benefits which accrued, 
teachers felt that a bit too much time was spent in testing. Similarly, 
the^respoftded that teachers would react favorably to the discontinuation 
of testing, though again their responses were not extreme. Finally, 
teachers stated that "they felt that their students 1 performance on re- 
quired tests was influenced to some extent by the instruction they received, 




23 



'but they stated that they believed student's motivation, test-taking 
skills, unusual circumstances, and test quality were more important 
factors* 

Factors Iafluencing the Use of Test Results • • 

Two lines Of* inquiry suggested factors which influence the use of 
tests by teadhers* First, c teachers were asked what features they con- 
'Sidered in formulating their own classroom testing prograto?. As stated 
in an earlier section of this report, we assumed 'that the mofe test^ ex- 
emplified desired features, the greater the likelihood they would be used. 
A second -avenue of inquiry was- more empirical: what contextual variables 
were associated with more test use? e.g., teaching experience,' classroom 
organization (-team teaching vs. self-contained), grade level taught, and 
ay%.i*kability of classroom' aides. 

* What test qualities were most important to teachers? Teachers reported 
that clear format; similarity to class material, and accurate prediction of < 
achievement were the qualities they considered most important when choosing 
prepared tests. Similarly, when asked why they developed their own tests 
rather than using cojpmercial tests, teachers cited suitability* for their 
students and sensitivity to classroom instruction as critical reasons. They 
stated that lack of funds, of time to order tests, or of information about 
tests were unimportant influences. ' Intuitive validity appeared to be the 
essential feature for teachers: did the test match what was taught and did 
itrprovide a suitable context so that students could exhibit their skills? 
This criterion contrasts teachers' perceptions of required te*sts as being 



• 24 

heavily Influenced by students 1 test-taking skills and other extraneous 
influences* w 

What contextual factors seemed to be associated with test use? Cer- 
tainly! grade level appeared to exert a significant influence* Primary 
grade teachers reported administering fewer tests, that they were less 
likely to develop their own tests, and that they would react more posi* 
tively to abolishing required tests than would upper elementary school 
teachers. 

Years of teaching 3 experience was also related to different patterns ' 
of test use* Younger teachers, i.e., those with less than eight years 
of teaching experience* appeared more skeptical .of testing. These teachers, 
relative to their more! experienced peers,* appeared more likely to* use their 
own tests and other less foipal methods (e.g., work assignments, informal 
quizzes, students' place in the text) to assess student progress, and less 
likely to use the results of required, standardized, or curriculum .embedded 
tests. They were also .apparently less optimistic about. the extent to which 
instruction influences* students' performance .on required tests, an opinion 
•consistent with their reported behavior. Perhaps these younger teachers 
were influenced during their preservice training by relatively recent cri- 
terion-referenced testing methodologies, and v/ere, therefore, more suspic- 
ioQs of published tests. > * 

The presence of aides was also associated with more frequent use of 
assessment data. Teachers wit^h classroom aides, -compared with those with- 
out such assistance, reported greater us« of curriculum-embedded tests and 
used student's place in their book and 9,ther informal assessments more 

30 



25 



4fe 



often to monitor their students' progress. It may be that teachers felt 
that considerable record keeping was required to mak£ good use of test 
data for instructional decision making, that a classroom aide would ease 
significantly the burden in this area, and thus might be instrumental to 
a teacher's use of test data. Further, the teachers might have been con- 
cerned about using test data for instruction decision making to identify 
and better meet individual needs. The availability of aides might make 
teachers feel that they then have more time to prescribe alternative set- 
tings for instruction, e.g., "aides can give tutorial assistance, supervise 
small student groups, etc. Without instructional alternatives, however, 
•teachers might feel less motivation to use test data, because they lack 
the resources to carry out more individualized prescriptions and/or needed 
remediations. Consistent with this .hypothesis, teachers with aides appeared 
less likely to allow failing students to progress to the next instructional 
unit, and more likely to provide* such students with remedial help, e.g.], 
tutoring and additional practice. 

Summary 

The findings of CSE's 1978 study replicated those of other researchers: 
Teachers in the sample reported that they do^not make much use jof^fie many 
standardized tests they are required to administer. Furthermore, while 
they perhaps were not adamantly hostile in the face of required testing, 
tjieir attitudes towards these tests, at best, were reserved. These atti- 
~~ tutfeslriay exf^i rr^ .—Teachers 1 knowl- 

edge in testing, no doubt, was also a contributing factor. 



26 

. t 

V 

i 

The teachers reported that required standardized tests comprised only 

* * * 

^1 small fraction of classroom assessment activities. Curriculum-embedded^ 
tests arid particularly teacher-made tests were not only more prevalent, 
apparently, but played a larger role in instructional decision making. 
These kinds of test^ apparently had considerably more validity for the- . 
* teachers in terms of their suitability for students and their curriculum ^ 
coverage, two prime criteria for teachers. * 

What other, factors contributed to the teachers' use of tests? Grade 
level, consistent with other studies, v&s'an important factor (see-Goslin, 
1965; Yeh, 1978) . Less toting wenj on in the primary grades in the sample. 
More interesting, however, was the finding that the availability of class- 
room aides was associated with greater use of tests. It was hypothesized 
that aides provided a support function for the teachers— both in record 

keeping and in making possible instructional alternatives—that enabled 

«, 

it 
teadhers to use test results for decision making and to implement those • 

decisions. During the 1978 study we saw the potential importance of making 
sufficient resources available to teachers to implement any new Idea, and that 
the systematic use of test data to improve instruction, in 1978, was a* 
relatively new idea. •• . - 

Adequate knowledge and training in the use of tests, i/c. appeared in 
1978, y ,were necessary resources. The survey indicated thertrmost training 
related to testing occurred during preservice education. Thus, while 
younger teachers might have been exposed to newer approaches to testing, 
many older teachers perhaps had not. Given, in addition, the questions 
the survey raised about the efficacy of teachers' training in testing, 

' 32 



the need for additional staff development activities seemed quite clear 
on the basis of our 1978 study. r 

The data from this study had an early bearing on the direction that 
our present investigation of test use would take. 



: 33 



28 



THE EXPLORATORY FIELDWORK 



Intentions 

The field work was H-nteded, in conjunction with other Phase I plan- 
ing activities, to serve two purposes; 

(1) To help refine and focus the conceptual framework . 
« and research questions guiding the three-year study. 

(2) , To inform construction of survey instruments to be used e # 

for collecting data from the National sample of .teachers 
and principals. 

The'on-site fi&ld work was designed to address such questions as; 
the range of ways teachers and others in schools seem to have for assess- 



ttV^s- 



ing student f s performance and progress;* the range of purposes that* 
sessment results—test scores and other information— seem to serve; tfte 
kinds of assessment and uses of results th^t seem most pervasive, most 
influential for curriculum and instruction; the factors seeming to /impact 
on assessment practices and uses most significantly; the relationships 
among those factors; arvd the adequacy of' the study's conceptual framework. 
The. fieldwork was aimed to provide information in response to such ques- 
tions as these and so assist in refining and focusing the s&rvey design. 

The fieldwork was simultaneously intended to inform. the construction 
of instruments for the national survey. The exploratory effort undertook 
to discover whether educators in the schools visited would find important 
study issues ,too complex, fc too ineffable, or otherwise too difficult to ad- 
dress succinctly, simply, and at the same time (from their point of view), 
accurately. Attention was also given to the kinds of questioning strate- 



34 



gies and fonjis that Seemed to bring t^e* clearest, most precise responses, 
to particular issues. The fieldwork also sought to -identify what types' 
(or role categories) of practitioners "in schools were likely to be best 
informed on. certain factual matters,' e.g., have complete information on 
tfie school-wide testing program, know who requires that particular tests' 
be given, etc* More funiwmentally, the fieldwork aimed to comprehend as' 
fully as possible the ways teachers and others think about and talk about 
the evaluation of student achievement, their instructional decisions and- 
practices^nd other matters into which the survey would inquire* In so 
doing, the exploratory work strived to provide data so that the language 
and concepts of the survey could be aligned with language and concepts „ 
through which teachers and principals-organize their experience; that is, 
one of our prime concerns at this stage in the project reflected the issue 
of validity previously described, in whiqh integration of the conceptual 
schemes of researcher antj participant is critical* 

Following % &om the purposes and objectives outlined above, , the fiefd 

f~* ' * » 

work was oriented to explore issues related to tt)e fallowing questions: 

' 1. What Kinds of Tests and Other Assessment Techniques are Adminis- 
tered? 

* * « 

2. What Purposes are Particular Kinds of Tests and Other Techniques 
Intended to Serve by Those Who Require Them? 

*3. What are the Features of the Social -Contexts In Which .Various 
1 Kinds of Tests and Other Techniques Occur? * ' 

m (including staff members 1 attitudes, perspectives, and reasoning 
*mon student assessment, their levels of experience and training, 
dernographic characteristics of the school enrollment, etc*)* 



* J?5 . 



4. What Are the Features of the Organizational Contexts in Which 
| Various Kinds of Techniques Occur? 

, (including leadership actions, fn-service programs, 6rganiza- 
tion of instruction, etc.)* " 

5. How and By Whom are the ResuKs of 'Various Kinds of Tests and 
Other Techniques Actually Used? . „ 

Sites were selected for field work 4 in terms of the following 

criteria: 

1. Diversity in Required Testing Progrim 

2. Geographic/Regional and Demographic Diversity / , < 
"(including diversity oT ethnidity, socioeconomic status, 

and first language among the students served.) 

3. Variation* in District Size and Resources * ■ 

4. Variation in Local Instructional Programs 

5. Variation in Reputed Skill and "Sophistication 11 in Test Use 

. J 
Accessibility within Budget Limitafcicffis^ • / 

if 

Phone contacts to gather appropriate selection information w£re made 1 
with persons familiar with state testing programs, with the salience of 
testing in different regions of : the Unitjsd States, and with local district 
activities. A set of "interesting 11 districts was thus identified. Then, 
usirtg a standard telephone protocol ,,iir^onnati.on was gathered from offi- 
cials in these "nominated 11 districts on\d^i^rand school activities. 
On the basi^ of these calls, three districts were chosen. 

Three schools for site v^its were identified jn each district with 
the assistance of district j/ersonnel. During this process, an effort 
was made to locate a rough! jLBalanced number of. elementary and high school 

1 

schools serving higher and lower socioeconomic populations, schools with ; 

* » / ** > *■ 

more traditional programs, and schools with more innovative 4 instructional 

* - c ' 
programs. * 3ft * 



4 



31 



Exploratory field data were gathered primarily by interview. A 
detailed description of the interview forms and procedures followed 
appears in the Test Use Project Annual Report to the NIE, 1980. In brief, 
two forms of an interview schedule were used. We were concerned, first, 
with the need to* balance the-"conceptual schemes of researcher and partici- 
pant. Second, we were equally concernetLwith minimizing biases that might ' 
stem from the questions asked by the researcher or from the kinds of ans- 
wers-offered by respondents. Therefore, one form of the interview was 
deliberately direct and addressed matters of "testing." The second form of 
the measure worked by the method of indirection, and addressed matters of 
"information teachers use for classroom decisions." Interviews averaged 
45 minutes in, length. They were conducted in three school districts (one 
in the Northeast, one in^the Midwest, and one in the Southwest) and nine 
schools with respondents in the following roles: 

Principals 7 

Vice Principals 3 

Department Chairs v 8 

Counselors 6 

Classroom Teachers 44 t 

Specialists 7 

District Administrators 4 

Member of Intermediate 
Education Agency 1 

80 Total * 

The results t>f the field work are summarized here in two forms. 
First, findings across the districts and.schools are presented. These 

* 

. 37 



• 32 

a* 

\ 

findings primarily address study questions. 1, 2, and 5 (iee pages_25-30h 

which were concerned with tests administered, intended purposes,, and 

test users. Second, descriptive narratives' of each district and school • 

are provided. These narratives, which primarily relate to study questions 

• ■ • * , • * 

3, 4, and 5, whteh were concerned with' social and organizational contexts * 

of testing and test users, are intended to provide an interpretive and 

contextual background against which to view the findings reflecting test < 

administration and purpose. 

General Findings 

AcVoss the nine schools in the three districts visited, a wide range 
of assessment techniques was evictaLt. It is important to note, at the out- 
set^ that respondents referenced these almost always by their proper names 
or by vernacular variants of proper names . That is,, they rarely talked 
about 11 norm- referenced tests," "criterion-referenced tests," "objectives 
based tests," "curriculum-embedded tests," etc. Instead, they spoke about 
"the Ginn placement," "the CTBS," "the Key Math," "that state matrix test," 
the "Sucher-Allred," 'and so on. When respondents did refer to kinds of 

A 

tests, most often they gave them functional class names,, e.g., "diagnostic 
tests," "placement tests," "pre-tests, "'"unit tests," "semester finals," 
"the competency tests." Exceptions were "standardized tests," minimum 
competency te^ts," and ".district tests" (or, the "district testing program,^ 
which referred to district-developed, continuum-of-objectives-based mea- 
sures in the particular sites visited), y . 

/ These observations are important in that they had obvious implications 
for our survey instrument development. But they are also noted here to 



call attention to the fact that the typology of tests and other techniques ' 
used in this report is one developed by the researchers using categories 
salient to the practitioners interviewed. 

As expected, a' wide range of assessment techniques was reported by 1 * 
the teachers from the riine schools. These 44 teachers (22 elementary and 
22 secondary) collectively mentioned the use of eight, categories of assess- 
ment devices for a total of 351 citations, which is more than likely a 
low approximation of the actual amount. The assessment categories as well 
as the number of citations of assessments in that category (in parentheses) 
follow: Standardized tests (43), Curriculum-embedded tests (63), District 
objective- based tests (19), Minimum compete^ tests (12), School -depart- 
mental, and/br grade-level tests (17), Teapher-constructed tests (101), 
Diagnostic instruments (11), and Other evaluation techniques {75). The ■ 
"other" category included such techniques as 'homework, worksheets, con- 
ferences, book reports, discussions, observations, etc. 

As can be seen from the above frequencies," teacher-constructed tests 
and "other" evaluation techniques -were cited most often by the teachers 
'interviewed, a finding which is fairly consonant with Yeh's (1976) con- 
clusion that curriculum-embedded tests and teacher-made tests are'used to 
a much greater degree than standardized tests, but despite high frequency 
of testing*, teachers are more likely to use personal observations and in- 
teractions with students than test results to assess student's progress. 
This latter p'oint was not reflected in the frequencies given above but it 
is possible that many of the teachers, and especially those at the elemen- 
tary level, failed to mention many of the informal assessment activities . 



that occur because 
part of the teachi 



'they are Used so frequently and are so, much an integral' 
^ process . This possibility influenced the manner in 
which we conceived and phrased items on the survey instrument so that the 
subject of informal assessment could be explo red- f urther. 

- The amount of time these assessment techniques take to prepare, ad- 
minister, and/or grade was also explored. Again, as expected, a wide 
range of time sper^t on evaluation in the classroom "was reported by the 
elementary and secondary teachers interviewed. However, on pursuing 
this. matter it, became apparent that teachers experienced difficulty in 
providing an exact estimate of tim§ indices. This was due to a variety 
of reasons. For one, some teachers could simply not remember how long 
the tests took. More common^, it was^discoverecT that teachers allowed 
different students varying lengths of tirite to finish the tests and thus 
found it difficult to average the time amounts for alT students/ When 
asked about the informal techniques they used, teachers found it next to 
impossible to estimate the time they spent as many of the techniques were 
ongoing and/or overlapping* 

9 

^1 though the aforementioned difficulties were encountered during 
the ihterviWing ^process the teachers 1 reports gave some indication of 
the time devoted to evaluation. The teacher^ tended to be conservative 
in their estimates "and when ranges of time were given for a particular 
assessment technique, 'we selected the midpoint of £his time frame for 
analysis purposes. 

The analysis of the . data showed that the 2J/ elementary teachers in- 
terviewed spent.an average of approximately ly percent of their reading 

40. 



35 



,and math instructional /class time assessing their students.. The 22 secon- 
cfcry teachers reported that about 24 percent, of their English and math 
class time was spent on evaluation. Thfe proportion of total classroom 
time given over to assessment was quite Targe for both the elementary 
and secondar^* teachers; one to 64 percent for elementary and six to 75 t 
percent for secondary* 

At first glance it appeared on the average that the secondary teachers ' 

spent more time assessing their students than the elementary teachers. How- 

* * r * 

ever; when, looking at ,the responses concerning the^types of assessments 
^given, the vast majority of the secondary teachers 1 responses Uere for 
.formal pencil -end-paper tests. Perhaps more formaj testing is occurring* m 
at the secondary level than at the elementary grades because of ttfe ages- 
of the students involved and because the secondary teacher has less time . 
for the, use of informal techniques and/or observations.* As the elementary A 
teacher ususally spends the full school day with the same group of students, 
he/she has more opportunities for informal evaluations and 'less need, for 
the more formal ones. Also, because the informal techniques were not cited 
S>y the teachers as frequently as* the more formal ones, the difXrence in 9 
the percentages .of time allotted to evaluation by the two sets of teachers 
was quite large. - 

The .analysis also showed similar, results for the total amount of time 
the teachers spent on evaluation. "This total time includes the preparation, 

« 4 . , 

^administration, and grading of tests/assessments. The elementary teachers 
- reported on, the average that 15 percent of their time (which includes in- 
structional arid non-instructional/preparation time) was spent on assess- 



ment while the secondary teachers spent 34 percent of their time on the 

« • • 

same. .The ranges report^by the elementary and secondary teachers were 
three to, 56 percent and nine., to 69 percent, respectively. Again, teachers 1 
tendency not to report informal assessments and the use of many more formal 

evaluation techniques at the secondary level may account for some of the 

> * • <*> • 

difference in the amount *of time spent on assessment in elementary and 

*> . 

secondary classrooms. . ' >* 

> f ^ ' . 

Hange of Tests Administeced / ' % 

\~ f 

Fieldwork indicated that a wide range of tests were being administered. 
For example, standardized tests, such as' the Comprehensive Tests of Basic 
Skills (CTBS), the Metropolitan Achievement Je|t (MAT), Iowa Test of 
Basic Skills and of Educational Development (ITBS/lTED), etc., were 
administered in* each school district visited. 

Curriculum-embedded tests of various types-were also given everywhere, > 
but almost exclusive!/ at the elementary grade levels. Most of 'the curricu- 
lum-embedded tests accompanied commercially-produced, elementary-grade series 
in- math and reading. .Among those given frequently were placement tests; ' 
the "unit! 1 or "criterion" tests designed to assess achievement , on a specific 
portion of the curriculum; and "end of the book" tests (i.e., thos^he 
•student took at the completion of a given reading or math 1, level")!- 

" Minimum competency tests were given in two of the districts. In one" 
case they were district-developed and included four separate instruments 
asse^rng fundamental math skills and four assessing skills in the langaage 
arts. These tests were given at the high school, level and passage of all 
eight was required for graduation. In the second district, an instrument ( 
•developed by the state for administration to ninth grade students' included i . 



the -general domains of reading ^mathematics, ajid' writing*. Its function 
was pnly diagnostic. t 

A statewide assessment' measure was given annually in one district 

*** * ^ 

to a matrix sampling of students at certain ejementary and high school 
levels. Individual student scores were not reported to schools, but ag- . 
gregations by grade-level, school, and district were provided on various 
subskills irt reading, -mathematics, and writing. 

District tests, district-constructed and mandated for use district 
wide, were part of the assessment picture in two of the three districts > . 
visited. 

School-, departmental-, and/or grade-level tests were found in five 
school sites. One high school , for instance, had j,ust developed and ad- 
ministered a writing sample in all (jrade levels* Departments ~in several 
high schools had teachers-developed mid-term^ and finals for particular 
courses. And in two elementary schools in one of the districts, 1&ams of 
teachers at particular grade levels constructed and give common tests keyed to 
their social studies curriculum. 

Diagnostic instruments. were also employed largely, but by special- 
ists sflch as remedial reading ^instructors, teachers of the "learning dis- 
abled" and "emotionally handicapped," and TUle I program staff members. - 
Almost all of these were found in elementary schools. 

Jeacher-constructed tests,, quizzes, and the like were, of course, 
extant in every site. *• 

Other measures of student achievementfwfcre also prevalent" in all 
classrooms. In the elejnentary grades, students' daily worksheets, class- 



"V 



38 



room performance, along With homework and other assignments, were men- 
tioned as ways of evaluating students' progress. These same types "of 
"measures" were among those- used by high school teachers." The latter 
also cited. conferences with students, peer evaluation of classroom reports, 

oral quizzes and question-answer sessions, group discussions, and a wide 

« <■ 

variety of written assignments ^s assessment techniques. 

The specific configuration of tests being administered in each of 
the districts visited is provided in the district narratives. 

Rqrrge of Reported Uses « 1 

Distinct patterns of use also grew out of fieldwork analysis, which 
suggested, that test scores and other assessment results were used for a 
finite number of purposes across the sites yisited. At the classroom level, 
there Was little school-to-school or district-to-district variation in 
the range of , uses respondents 1 reported . Eleven types of uses for assess- 
ment information were inductively derivable from the specific comments of 
educators interviewed. Recall that the uses listed below are those which 
individual respondents said the # themselves made of test scores find other 
student assessment "data," 

(1) Referral to and/or placement in special programs, 
. appropriaFte cfasses, appropriate "tracks," etc. 

(2) Within-classlroom placement of students at appropriate levels 
in individualized programs, in reading or math groups, in 

. occasional, temporary skills remediation^groups, etc. - 



1 strengths," 



(£) Planning instruction: "figuring out my class 1 

"learning what the group needs^," ^getting feedback so I know 
.what we have to go over again," "working with one of my grade- 
level groups of 'teachers to decide what areas they need to - 
Strengthen,"* etc. % < 

(4) Monitoring student's progress, "seeing how they're doing as 
we go along," '"just getting a sense of whether they're learn- 
ing anything." 44 , 



(5) Holding students Accountable for doing assigned'worfc, main- 
taining class discipline, , 

(6) Assigning report card grades. 

(7) Certifying students 1 competency for promotion, high school 
graduation. • 

(8) Counseling and advising students about how they are doing, 
about their preparation for future courses and academic 
cjffals, about their achievement, motivation potential, etc. 

(9) Informing parents of tow their children are doing in regularly 
scheduled conferences, at "back-to-school " nights, special 
meetings, when problems arise. tf 

(10) Reporting to higher organizational levels within the district 
^~to the principal, district offjce, the school bo'ard--on 
student achievement. 

, (11) Comparing groups of students with others, judging' how a class, 
school or district is performing relative to others. 

Patterns of Assessment Results Use 

From the respondents 1 comments about how they used the results of 

» 

particular tests and other assessments we developed a coding scheme to 
index the importance of particular results for particular purposes. ■ This 
simple scheme depicted the use of a score or result for a given purpose 
as: (1) the sol ^information source Noised; (2) one of two or three major 
sources; (3) one of many spurces; (4) a verification source, i.e., usetf 
ancillarfly to* check decisions or conclusions already reached based on 
other information source?, and (5) not used, simply administered. 

Interview data from the 44 classroom teachers included *330 descriptior 
of how the results of particular types of assessment were used.* They , 
also included- 21 statements that the respondents did not use results of 



Redundant uses for different tests of the. same type were dropped out in 
collapsing 'the 346 tests/assessment means cited into the eighttypes of 
assessment listed earlrer in this section. 



types of measures that they administered . ' 

r 

- As Table 1 indicates, teachers rarely used only one type of assess- , 
meht information to make a given decision or accomplish a given purpose* - 
Only 5.1 percent of tJbe usfes cited (including statements of non-jjse) were 
"sole source" uses, i.e. ^results used alone to make a given decision. In 
two-thirds of the cases, results- from a particular type of assessment 
Were .uspd as one among many "types of information employed for the particular 
purpose at hand. ^ 



Table 1 

Overall Patterns of Assessment Results Use 
Functional Importance 



Instances 
Mentioned 



Sole Source 


One of 
Several. major 
Sources 

.4 ■ 


One of 
many . 
Sources 


Verification 
Source 


Not . 
Used 












* 

18 

(5.135) 


. (18.55?) - 
*± - 


237 

(67.5%) 


10 

1 (2.35S) . 


21 
(6.0%) 



Total 



In short, it appeared t4iat teachers were most likely to look at a * 
variety of different kinds of information as they make the judgments ^ 
analyses, and reports they" must make as part of iheir routine professional 
^cttvities . ' *' r * 

Test information used as sole and ^major criteria : * If most means of 
assessment provide information that is used jointly with others, which 
means do seem to provide information* that functions*as a sole or major 



criterion in teachers' activities? Table 2 provides an answer in over- 
view. 

Table 2 

Typos of Tests Used by Teachers 
as Sole and Major Sources of information tor any Purposes 

* : — ■ - — 




Test 
Type 



Total* . / Total : . 

Citations , Count / Sole & Major 

all (Column %) (% total 

levels Sole Source Major Sourcei • in Table) 



Standardized 


43 


6 

(33.3) 


> 


5 

(7.7)*' 


11 
(13.2) 


Curriculum 
Embedded * 


63 


5 v 
(27.8) , 




12 

(18.5) , 


17 
(20.5) 


District 
.Objective-Based 


19 


1 

(5.6) 




6 

(9.2) , 


7 

(8.5) . 


Minimum + 
Competency 


12 


0 

(0.0) 


* — 


0 

(0.0) 


0 

• (0.0) 


Stiatew'ide 
Assessment 


10 


0 

.(o.of 




0 

. Co.o) 


0 

(0.0) 


School /Department 
Grade-Level 


17" 


0 

(0.0) 




9 

(13.8) 


9 

(10.8) 


Individual Teacher- 
Constructed J 


101. 


5 

(27,5) ~ 




15 

* (23.1) 


20 
(24.1) 


Diagnostic 


11 


• 0 

' (0.0) 




0 

(p.o) - . 


0 

. (0.0) 


Other 


. 75 


1 

(5.6) 




(2^7) 


19 
(22.9) 


* TOTALS- v 


351 


18 




65 


83 

(100.0) , 



*Count oi.aH instances in which test type was mentioned as. 
used in a)ny way, including "not used" category,- 



Minimum competency tests were used as the sole source for' 
deciding whether students graduated from high school in 
one district, but this decision was not made by classroom 
teachers or other school -level practitioners. 



' J , 42 - 

- V' ' 

From tfi£ above, a picture began to emerge of teachers drawing upon 
I * 
mar\y types of assessment to do their routine instruction-related work. 

And the fieldwork data suggested that the types of assessment they use 

most frequently In this routine work tended to be those that are 

, # most immediately accessible to teachers and which provide most 
immediate results; those over which they have most control— can 
administer, when they choose and can see the results -promptly; 

« * 

# those which purport to serve functions isomorphic with the tasks 
teachers must routinely do; i.e., curnculumr£mbedded placement 

6 tests figure significantly in placement decisions; records of 
progress through arcontinuurn for placement in a continuum; tests 
that teachers, design or text publishers produce for measuring 
achievement on a unit of instruction for monitoring progress and 
grading students on that unit, etc. 

# those whidh teachers deem to "cover" most exactly the content of 
the material they are teaching. 

In short, those tests teachers see as linked most closely to the routi 

practical activities of their everyday professional lives are those they 

use most often. Additionally, the phenomenological evidence of everyday 

experience with students plays an important role in teachers 1 assessments 

of tKem. 

The single exception to this generalization appears- to occur in the 
use^ standardized tests. For the^most part, teachers used these, for 
general reference, to get an initial sense of how their new classes "look 11 
relative to others, or as a normative reference point against which to 
gauge progress—except, it seems, when they are r?equiYed to do otherwise 
by district mandate'. * • . 

Test information that is not used : in 21 instances, teachers said 
they did not use the results of one or another type of test that they . 
gave* ten teachers mentioned their non-use of standardized test results; 



43 



* seven mentioned non-use of statewide assessments In the case'ofthe latter, 
teachers had no access to students 1 individual scores or results aggregated 
by class. - * 

, The above descriptions began to indicate some of /the activities in * 
which assessment results play ^definitive or major role. Table 3 pro- i 
videsa comprehensive picture of the purposes for which they do so, 

• Table 3 • : 

Purposes for Mhich Teachers Use .Various Tvoes of Assessment Roants 
as Sole and4tefer Information Sources 

"Cpunt: Number of Citations 



' Purposes 



Sole Major Total 



Planning Instruction 


1 


9 


10 


Referral/Placement: \ 
Special Program 


4 ~ 


5 


9 


Within-Class Grouping and 
Individual Placement 


7 * 


18 




Holding Students Accountable 
for Work, Discipline 


1 


6^ 




Assigning Grades 


0 — 


9 


9 


Monitoring Students 1 Progress 


0 


6 


6 


Counseling and Guiding Studeats 


5^ 


.•8 


13 


Informing Parents < 


•0 -■ 


1 


1 


Reporting to District Officials, 
School Bb&rfl, etc. 


0 


2 


2 


Comparing Groups of Students, 
Schools, etc. 


or . 


1 


' 1 


♦Certifying Minimum Competency h 




0 " 


. . 0 


7 - ~ 


TOTAL ' . 


18 




.83. 



i Table Total )- 
(12.1%) 

(10.8%) 

(30.1%) ' , 

(8.9%) 
(10.8%) 

(7.2%) 
(15.6%) 

(1.2%). 



(1.2%) 
(0.0%) 



*Note t • In one district visited, tests of minimum competency were required 
'for high school graduation. Respondents, however, took thds as obvious 
and^jcarely mentioned that they served in this way. When they did speak of. 
the uses of minimum competency results, they described their uses for other 
purposes,,' ' - - 

As Table 3 sh6ws, test scores seemed to play ah important role in . 
student placement decisions. In 40.9 percent ofv the instances in which 



44 



teachers reported that they used< assessment results as a sole criterion 
, orjajnajor criterion,, the placement of learners was at issue. The use of 
scores as a major basis for in-class placement was especially frequent. 

Summary . Mosteoften, teachers seemed to considelr the results of 
several &ypes oFassessment collectively in arriving at a particular de- 
cision or carrying out a particular activity. When. they reported depart- 
ing from this practice, it was mbre~o£ten in the dtrectiowaf weighing 
test scores more heavily than in«the direction of count? nfthem less. 
(Citations of results as sole and major information sources equaled 23. 6„ 
percent of the total; citations of results not being used or. used only in < 
verification equaled 8.8 percent of the total.) The. placement of students 
,.seemed^io be an activity which the results of one test*of type^of te§t 
> may courit jnore Jieavily than in others. ' ^ 

Relationship^ Between types df Tfe^ts and Categories of Use* 

table 4 sunmariies/thSf^ relationships reported 

by both the .elementary (h=22) an^ ^ecj^rdary <n=22) classroom teachers *- 



interviewed. The tabl 



assessment results 
, PlanhMtg for instruction 




e$ that the main uses' of test and other; 



J 



afouping students and placing them at levels of individualized 
programs within classrooms : 



Grading 



Monitoring students 1 progress, i.e.^ k&ep4ng, track of how they 
are doing over time. % 



so 



0 

ERJC 



51 



2 



Table 4 

Types of Tests and the Uses of Their Results 
Type of Test 




Planning a 
Instruction* 


9 a 

U' 


8 .2 
- TO. 


3- 0 
3_ 


1 3 
4 


2 - 0 
2 


T 2 
3 


11 13 
24 


1 1 

2 . 


13 -8 
21 


49 33. • 
,82 


Referral/Placement: 


9 2 
Jl 


0 


0 


0 1 

x 


0 


0 2 '. 
' 1 


2 1 
1 


0 


2 4 

6 - 


T3 10 

11 ' 


Within Classroom 
Grouping & Individual 
Placement 


4 0 
4_ 


18 . 0 

TO 1 


5 0 
5 


T 2 
3 


0 T 


1 3 
4 


2 4 
6 


6 0- 
6 


11 3 
14 


48 13 

en 


Holding Students 
Accountable for Work, 
Discipline 


0 


3 0 
3. 


0 


0 


0 


0 / 


4 4 
1 


0 


2 0 
2 


9 4 

11 


Assigning Grades 


0 T 

£' 


14 3 
T7 


1 a** 

I • 


0 1 % 


0 


0 5 

5 


15 17^ 

32 . 


1 0 

1 


7 1 
8 


38 28 
66 


Monitoring Students' 
Progress 


0 


u- 0 

U 


4 0 

4 ' 


a , 


0 


0 2 
2 


10 8 

18 1 


1 0, 

12. . 


10 2 


39 12 
51 


Counseling 4 Guiding 
Students ' 


3 


0 


2 0 
2 


0 


0 


0 


2 8 
JO 


1 0' 

1 


4 2 . 
6 


10 12 
22 


Informing Parents 


0 


0 


T 0 

!• 


0 


0 


0 


0 




1 0 

1 - 


2 0 „\ 


Reporting to District 
Officials, School 
Board, etc, 


p_ 


T 0 

x 


~-z — i 
2 0 

2 


0 


0' . 


o - 


0 


0 


3 0 
3 




Comparing Groups of 
Students, Schools, 
ctc. % 


I 0 

X . 


0 


T 0 


0 


0 


o . 

f 


0 


o* : 


1 0 
• 1 

• 




Certifying Minimum . 
Cowotoncii 


0 


o • 


0 


0 1 


0 


JL JL _ 


£ 


0 


0 




TOTAL 

Use CITATIONS 


24 9 

33 • 


58 5 

S3 . 


10 0 


2 8 • 
TO 


v 


2 T4 

11- 


22 55 

101 




51 20- 
74 


217. 113 - I 

m 


Fx?11dt Stntenonts: 
"NOT USED"' 


5- 5, 
TO 


SI 

0 , 


0 " > 


1 ' 1 . 
2 


0 7 
7 


t a. 


o ' 


< f 0 . 


0 T 
T 


7 14- J 
21 


Total Citations 


29 U 

42 ; - 


53 ".5 
•63 


T9 0 < 
T9 » 


3 9 


2 8 
TO 


3 14 
T7 


46 55^ 
TOJ. 


10 1 

11 


54 21 

75. 


224 127 :. 



ai 



46 



Summary . The exploratory fieldwork' indicated that the sample teach-„. 
ers most frequently drew on the results of three types of assessment. 
These are (1) their self-constructed tests, quizzes, and written assign- 
ments, £2) other assessment techniques that they devised or chose to 
seek out and use, such as- class discussions, peer evaluations of work-, 
conferences with students, jalks with their students 1 previous teachers, 
oral reading sessions, etc.; and (3*) curriculum-embedded tests— those 
that come with district-made curriculum "packages" or commercially pub- 
iised texts, kits, and the like. They appeared to use each of these 
three types especially, but others as well, in accomplishing a variety 
of purposes. That is, teachers seemed to refer to eacj? kjiid of assess- 
ment result for making a variety of judgments, just as they seemed to make , 
a given decision by referring to a variety of assessment results. Prin- 
cipals 'seemed to engage in a similar practice, .although the test scores 
they used most, often and the purposes for which they used them most fre-- 
quently^Hiffered from those of teachers. All this suggested, of course, 
.that the national survey should examine patterns of test type/ test use 
relationships/ ' It should not assume simple one-to-one correspondences 
between a test score and a use. * 

Teachers most frequently cited test scores and other assessment results 
as serving them in four activities: planning instruction, grouping and 
placing studentsin a continuum of objectives within the classroom, as- 
signing* grades, and monitoring students' .progress over time. Counseling, 
guiding, and other use seemed to follow, fr.om the factors previously 
discussed.. • 



\ 



53 



47 



A final point is worth' noting again. Returning to Table 4, it is ob- 
vious that some activities for which teachers use student assessment re- 
suits are relatively "under-mentioned." For instance, conferences with 
parents are a routjne p£rt of teachers' work, especially at the elemen- 
*tary school level. „A talk with ^ny teacher about his/her students in- 
evitably includes comparisons with students in other classes or schools, 
students in previous years, *nd so forth. - That these activities were * 
cited relatively infrequently as uses of assessment results was trouble- 
some to us. In ta/Tfcing with teachers, however, it became evident t'hat many 
bf* the practica\ tasks for which teachers use test information are, in 
fact, "transparent"" to them/ That is, they are so jnuch a part of every- 
day life that they go un-noticed. They are treated, literally, as unre- J 

uiarkable. That this is, so fs probably best illustrated by a comment made by 

» » 

a high school assistant principal in the first district -visited, who ex- • 

• * > ** 

plained in the same breach that they did not pay much attention to. CTBS 
scores in his school because the typical freshman entering the school was ♦ 
"two years, at least, below grScf'e levefT*^*; . • 4 • 

This should serve as a caveatthat Table- 4, and the discussion which 
t has followed from it, is not a complete picture of the frequency with 
which the teachers interviewed use'test results for^ertain purposes. 
But, given the open-ended nature of the interviews, it is very likely a 
comprehensive picture, overall, of the kinds of uses that test and other * 
assessment results serve. • . m ^ , m 

• * . • ' , 54 . ■;■ 

~A - - - - ' * < . * T ' 



Btst*46fr^rr a ti vfes . \ * 

Content analysis of the taped transcriptions of the nine%schools 
across the three districts provided information bearing orr the W^ial 
and organisational contexts in which tests are administered and used/ . 
This analysis suggested that five factors seem t6 have a bearing on 
the atmosphere 4rrwfm:h te§ts ar& administered, and consequently how . 
they, are v value4 M by teachers and used/not used in classroom decisions. 
6n the basis of fieldwork, these factors emerged as:, : 

0 Instate testing policy' and requirements \ s ' * % 

(2) coherence of school /district testing policy and requirements 

(3) leadership in the instructional uses of assessment information 

(4) locus of ownership pf the assessment program 

(5) recognition that no single test can serve (nor is intended 
.-to serve) the information needs of decision makers who 
/resflfect a variety of interests fnom broad program accocmt- 

" ability to specific classroom practice. 

While we had nQt intended fjelqwork to provide a picture of 
"exemplary" test use (that would , possibly emerge during Phase II of the 
-project), analysis of responses did suggest a tentative picture of how t 
contextual factors <may* converge to make tests appear "usable" (as 
previously described on pages 6,*42 of this reportl. As will be seen later, 
the district which $eems to hive a successful testing program— successful 
froni the standpoint of reconciling or balancing external testing 'require- 
ments with school-level uses of testing— assumes an organizational ' ^ . 
posture which has elements of centralism and diffusiveness. The importance 
of this observation emerged from our cross-project collaboration. That, 
is, one of the Test Use Project st^f was involved with CSE's Evaluation 
Design Project, which has been examining evaluation/testing matters at 



f / 



49 



./ 



ERLC 



the district level. Part of the collaboration involved the production of a 
CSE monograph entitled Evaluation in School Districts : .Organizational 
Perspectives (Bank & Williams: 1981, in press). During this' inter* 
prpject work, some of the findings stemming from work at the district 
level, an^which are discussed in the monograph, took on importance for 
an investigation of testing at hne school and classroom 'level . 

For example, it is possib]e that an organization and its 
constituent parts can (or perhaps, sh'ould) be "loosely-coupled" in 
"some regards and more tightly coupled, in others. Thisr variable posture, 

0 

when applied to bur fieldwork findings, appears to lend itself to mul- 

tiple uses of assessment information:* uses which are central and con- * 

cerned with external ^accountability and reporting requirements and uses \ 

which are spread out and reflect the decision needs of individual schools 
* ♦ * 

and classrooms. This is no,t to suggest that a balance of central authority 
and dispersed decision making is the orrly approach that will lead to develop- 

, n 0 

ment of a "usable 11 testing program. But it appears to be % the 'approach 
that has evolved, over time, in one of th«r districts we studied, and it 
seems to reflect not A only organizational reality, but the careful determina- 
tion of various decision needs and specification of an assessment infor- * • 
mation system that will meet these needs. • J 

Assessment programs often intend to provide information for use at 
local, state, and£or federal policy levels/ Often -the program will tencH^ 
to emphasize the Information needs of one of these levels tojthe exclu- . 
sion of f the others . Many assessment, programs appear to-be driven, or 
are perceived by the people in them,- to be driven more by broad, external 

accountability than by concerns for classroom and school-specific 

* 

'56 ' 



5Q 



information.- (This issue of external- "linkages" is also discussed in 
Bank &*Williams, 1981.) . Audiences associated with these external require- 
ments "often aik" for assessment -information that can be used to compare 
educational programs rather, than to show the growth of individual pupils 
in terms of specific set of educational objectives* A school system 

which tends to ^respond more to the external .audience than to others fre- 

♦ • * » 

quently relies on the collection and analysis -ef pupils' scores on a 
norm-referenced te£t. It may be criticized for lack of 'concern with 
individual students and thefr growth on precise instructional objectives. 
A schooV system tending to respond solely to audiences concerned with 
individual- student growth in a given classroom (no such sys - • 
tern was discovered in the present study) hmight tend Jto rely more oh 
criterion-referenced or objectives-based tests to provide information 
for diagnostic and prescriptive information. A school system taking this 
position' might be subject, to questions about the educational significance 
of the scores obtained on this kind of test — What do they mean? Do they 
show whether the learning that has taken place is important or trivial? 
• How do tKe scores obtained on these tests compare with the scores ob- 
Gained on otherHrinds of tests? 

A school system might attempt to reconcile both kinds of information- 
, needs, t^xamfneythe operant assessment requirements, to investigate their 
own assessment needs',, to determine which kinds* of information will address 
theTange of needs, to dec^d^ which kind of measure is most appropriate 
for generating the information addressing a particular decision area, 
to specify for \t% participants the* intended uses of various measures, 4 
and thus design a coherent assessment program which is perteived to 
have a variety of overlapping uses. * 



. '51 
/ 



One of the districts we spent 'time in appears to have developed 
this kind of assessment program. The two other districts we visited 
• seemecl to be trying to move in this direction , but still seemed to 
. be more concerned', or at least their teachers felt they are more con- 
cerned, with external accountability issues. ' 




District One 

This school district, located in the urban Northeast, has 24v.el^nentary 

schools (kindergarten to grade 6 primarily; a few are K-8), 2 middle 

schools (grades 7-8), and 3 high schools (grades 9-12). Total enroll- 

ment is* 27,000, with approximately. 50% Black, 30% Hispanic, and 20% 

Anglo and other combineid. The district had approximately 18 schools 

that are Title I eligible. 

jThe state in which this district is located has a minimum compe-, 

tency testing program which is still in aforma'tive stage of implemen-. 

tation. While no final determination had been made at the time data 

were collected, school district officials did pot anticipate that the 

proficiency test would become a requirement for high schbol .graduation. 

By the provisions of the state requirement, which focuses on "education, 

evaluation, and remedial assistance, "all 9th graders are. tested for 
* 

proficiency. Any student scorjrig below a certain cut-score (established 
by the state) must receive remedial- assistance from the l oc al school/ 
district^. The state required. testfng covers the areas of reading/ 
language arts, mathematics, and rflso calls for a student writing sample. 
Beyond the state required minimum competency testing program,* the . 



ERIC . Da 



J • . 

52 



dfstrict has its pvm testing program, which is also in a formative stage 
of development. This district testing program deals- with the areas 
of reading and communication arts, and includes the use of a locally de- 
veloped criteirion-referenced measure. This test is structured by grade, 
scope, and sequence, is intended to provide mastery data, and is ad-~ c 
ministered by teachers and/or reading consultants. It becomes part 
of the student's permanent school record and follows him/her from grade to 'grade* 
and school to school. District officials anticipate that when this test 
has been fully developed, it wilt become part of the district's response 
to the state required minimum competency testing program. 

As part of the district's required testing, the Metropolitan 
Achievement Test (MAT) is used in grades 2 through 8. It is administered 

eyery spring. At the^high school level, the Comprehensive. Tests of 

J * 

Basic Skills (CTBS) is administered in the 11th grade. 

The district test, which is accompanied by a specific curriculum, 
is supposed to be administered in all, schools as part of an 'attempt to 
standardize the curriculum; this was apparently not happening in actual 
practice, heaver. 

District Two 

The second district we visited is located in an urban area in Jhe 

, * 

Southwest. This district has over 100 elementary schools, 20 junior Jhigh 
schools, and 14" high schools'. Total district enrollment is a little 
over 100,000. 

The state in which this district is located has a required minimum 
competency program for high school graduation. Local districts can use 



53 



a state developed test on select/develOD their own. This district has 
developed, iis own competency* program to y+ meet the state requirement. 
Among the tests in use in elementary schools are: CTBS; the state 
assessment program; the' district competency test; and variable use • 
of a rangeof curriculum-embedded- tests and teacher observation and 
classroom interaction. Among jthe r iests, in use in the high schools are: 
the state assessment program; district competency tests; CTBS: 
tests associated with college entrance; and variable use of teacher 
constructed measures and classroom observation 'and interaction* 

4 » » * 

District Three ' > 

The third district visited, which demonstrated multiple and "exemplary" 

• 4» H t 

uses of assessment information, i*s located in a rural community in' the 
Mid-west* This district has s«ven elementary schools, three junior 

0 ; / 

high schools, arfd one high sfchool. Total district enrollment is a little 

over 5,000 students, of wh6m-onfly 6 percent are minorities: 

— • * - * 

The state in which this district is" located has no required mini- 

mal competency or proficiency Resting* The only state requirement is , 

that districts must identify* student need£ and set .plans to meet de-, 

r sired levels of achievement. ' * * 

■ ^ ^ * « ■ ~ . , 

.Among the tests used an? the Iowa Tests of Basjc Skills (ITBS* 
grades 3-8), the Iowa Tefts'of Educational Development (ITED, grades, 
9-12)* the Cognitive Abilities Tests (CAT, gradesj ,3,6, and 9), djs~ 
trict/school developed objectives-based tests, and curriculum-embedded 
tests. - 

< 

Schools in this district also enjoy the resources of an Area' 



1 



•SO • 



- t i 

Education Agency (AEA). .One of the /unctions of this agency is to 
provide technical assistance to schools and individual teachers who 

have questions, problems, and needs in testing* 

* . . 1 

This district differs from the first and second on some important. 

dimensions. In the third districts the fairly well accepted, district/- 

school developed tests seemed to reduce the amount of time that teachers 

spend constructing and administering their own tests (especially at 

the, elementary schools), thus freeing .instructional staff for other v 

tasks* These locally developed fests are largely seen as complementing 

the use of standardized' tests, and serving different, though related 

decision. needs/ In ad[ditiorf, with greater acceptance of district testing 

there seemed to be a clearer sense among the teachers of both the 

r 

"district" itself as an educational system and its testing policy 
and intentions, which teachers did not seem to see ^as threatening. 

Much of the information provided by the respondents seemed to reflect 
needs, issues, and concerns about the levels of decisions (Baker, 1978) % 
that might need to be made "On the basis of assessment information.. Two 



of these ^ levels 1 and r 2^ were alluded to previously. Level 1, reflecting 
information needs to make decisions about individual students, is of/ 
prime concern among 'teachers, specialists, guidance counselors* Level 
2, reflecting information needs to jnake decisions about groups of stu- 
dents within a school, is also of concern for teachers, but somewhat 
more so among department chairpeople, grade level coordinators, and 
principals. Level 3, reflecting information needs to make decisions 
about groups across schools, is the concern of decision makers at 
LEA, SEA, federal levels,' and the general public* 



55 * • 
4 

Test Uses/Issues in District One 

In one of the schools in this district, an elementary school, respon- 
dents did not appear' to value the district testing program. There was an 
impression that .the school administration, which had been recently appointed, 
was selected to stress the district* program and the need for accountability 
at the level of the school. Respondents seemed not to' see the pur- 
pose nor the relevance of this testing program. They did seem to be 
concerned with the kinds- of tests available, their match with class- 
room curricular concerns, and the instructional unit at which the test/ 
has decision-making relevance^ Teachers here were largely concerned 
that the tests being used did not seem to match their instructional 
concerns and related information needs. They saw little coherence in-the 
district/school testing policy and expressed little confidence , in its • 
classroom use. f 

In Wdther" elementary school in this district, the school administration 
and some, of the cuxJW'Culum and resource specialists seemed to concern 
themselves to an extent with accountability (level 3) decisions, but 

J *- /! 

i 

the teachers did not seem overly concerned with this state of affairs. 
It appeared that they not only went about the' business of making their 
in-class and in-school (level 1 and 2) decisions, but also received 
a level of expert assistance in making these decisions that was not 
encountered in the first school. 

The third school visited in this district was a high school. Perhaps 
' the most severe problem at the school is the fact that most of its students 
do not graduate. In an attempt to specifically pinpoint student deficiencies 

t 

* 'a * 



and make appropriate curriculum changes, the norm-referenced test being 

> ■ - 

administered ~ the CTBS — had not proved useful. There was a . 
hope among staff that the district testing program (as well as improved 
use Of department tests) would come to serve asf'student motivators and 
as a means to restructure the curriculum. > 

District Summary . " 

Several testing issues emerged in this district. First, the state- 
required testing program was still in a formative stage. The .district 
testing program, which" responded to state competency testing, was' equally 
recent. The district program seemed intended not only to serve the 
needs for competency testing but also to help stai%dardize the curricu-^ 

4 

,lum district wide. At one school it was •segnJiy. Jfceachers ^as no more \ 3 * 
than another accountability measure^ if it had some instructional* value, 
it was not seen by the teachers. In this school, teachers seemed .to v . 
have little sense of district, or school, testing policy. Teachers 
seemed to feel that required testing served only level 3 decisions; it 
helped th.em.not at all with level 1 and level 2 decisions and, .indeed, 
may get in the way of teachers using measures of their own choice for 
these purposes ♦ 

In the second school, teafchers seldom mentioned the district testing 
program. The 'teachers here perhaps understood the purposes of the pro-' 
gram' and so felt less threaten ed_i>y it: On the other hand, they simply 
may not care either way if it does not get in the way of their c^asskpom 
activities?. One explanation is that concerns of i the district testing program 
(and level 3. decisions) are seen in- this school as the responsibility of 



. - 57 



the school administration and specialists. It appeared that these \ 
specialists, some of whom were concerned about the amount of testing 
taking place, used the district measure not. only for district concerns 
but also, where appropriate, to help classroom teachers with their 
internal level 1 and level 2 decision. 

In the third school, standardized- tests administered in the past ^ 
had served no purposes in instructional improvement- There was a dis- 
tinct-impression that the school was assuming V policy of "wait and 
see" in the hope tfcat the^ew testing program would help them. 

In general, the district testing program seemed to suffer from 
lack of clear policy and guidelines; in only one of the elementary 
schools, was there any sense of leadership in the instructional Ose 
of assessment information. It seemed that at the high school a policy 
was- emerging which may lead to a sense of ownership of the testing prp- 
gram. „ 



Test Uges/tssues in District Two 

In one of the elementary schools in thjs district, a prime concern * 
-of the teachers was that tests would be used not only^to monitor bufldipg 
progress, but also to evaluate teacher performance. ^The-principal ^stated 
that if teachers belief they will be evaluated on the basis of test 
scores, this is acceptable if that is what is required to achieve in- 
structional improvement." * 

In the second school visited, a high school, the impact of minimal 
competency testing and the time devoted to this testing has had a profound 
influence both on teacher attitude toward required testing and also toward 



64 



.V 




the uses they make of other kinds of tests. 

In the third school visited, also a hi<jh school, the impact of 
minimal competency testing was felt to be equally high, influencing 
not only the amount of testing taking place but &ls<? the conjtSftt of . 
instruction in the classroom. , *$*wm?~ 

\ . 

, District" Summary . - , 

\ The advent of minimum competency testing bas had an observable 
and, from the standpoint of l some respondents, a negative. effect on jr* 
regular classroom instruction >wt^fhe kinds of resource options made J 
available to teachers. While*the effect seemed to be more pronounced 
at the high schools, it also had a bearing on the policies of elementary 
schools visited. 

In many respects, teacher concern for amoiint of testing, kinds 
of tests administered,* and the uses to which they are put e'choed the 
' kinds of responses encountered inthe first district visited. This 
- is especiqflly true with respect to minimal competency testing. 

Test Uses/Issues in District Three 

Ir) one of this district's elementary- schools, while there' were ' ' 

some /teacher-perceived problems with testing, teachers seemed to view ' ' 

tests as a more Useful decision-making tool than was the casB in the' 

■ first two districts. The test selection/development/use insefvicfc 
' ■ ' ■ ... v -.*-•.- 

offered in this district appeared to strongly influence tj^cheg^acceptance 

and* use of test results'. Of equal importance, however; are the services 

.offeVed by the AEA, a kind of teacher center in which adyice, technical 

r 

assistance, and actual tests can be constructed/selected by teachers. 



65 



59 



' Another factor appearing to influence teacher use of tes\s was 

\ 

the atmosphere in which testing policy is conveyed. The district and „ 

c - I 

school administration apparently set broad test information require- 
ments, intended to serve both external accountability and internal 
instructional improvement needs,in which departments and teachers have ■ 
several options. 4 

One of the respondents in the first school visited described the 
history of the district's approach to testing and th^role 6f centralized 
training and technical assistance; As a media specialist responsible" 
for providing "teachers with the materials they need to teach kids, 11 
several years ago he developed an interest in computer assisted instruction 
His interest in CAI led to using local computer services for test 
scoring and data analysis? This led\to a district interest in "computer 
analysis rather' than hand scoring, to /give you a better idea (of) 
where^the kids are ..• You don't have/ the time or the expertise in ~ 
the classroom, generally, to do that; the computer does it in one 

• : : < C 

fell swoop/ 1 This quick and accurate # scoring service, covering all * - 
the^various kinds of tests used, was available<to any teacher in the 
district. Over the years, further, the link from CAI- to "test scoring 
and analysis l^d %o a further computer application. That is, teachers 
had gfadoally developed large banks*of educational objective?, had 
written or adta v pted hundreds of test items written at varying Revels 
of difficulty, &nd could resort to the computer files to call out a 
particular kind of>test for a particular instructiona^urpose. . Over * 
the years if appears that local teacher involvement, with technical 



60 



assistance and leadership from the AEA and district officials, has 
led to a greater degree- of test sophistication and test use among 

« * 

^teachers' than was the case in district, one and two schools. 

Therefore, while some teachers expressed concerns about, such prob- 
lems as the lateness of receiving the results of the standardized test 
as well as their relevance for some classroom objectives, these criti- 
cisms did not carry over to testing in general. Indeed, some of 
the tests used were seen as invaluable- for both teachers and students. 
Tests seemed also to be used as instructional motivators whose results were 
discussed by teacher and students as o[ne more source of diagnostic 
information. The link between testing policy and test use seemed 

q » f 

clearer than in the first two districts! In the third district teachers 
seemed to feel the testing program was in part their own, to be used 
for their level 1 and 2 classroom decisions as well as for school and 
district accountability matters, and to be tempered by teachers/ 
professional interactions with their students.. 

The second school visited, also-, an /elementary school, appeared 
similar to the-first in terms of uses of assessment information. The 
nom^-referenced test use the ITBS --.did not appear to receive * 
a 'great deal of emphasis for classroom decisions, although it was useful 9 
to the administration in making decisions about building-level .effec- 
tiveness. 

District developed* and validated tests did appear to' be weighed 

heavily for certain kinds of within-class decisions 'as well as for 

■ * 

teacher self-monitoring. For many of these decisions, further, teachers 



61 



also relied on less formal means of assessment in the interests of 

making the best instructional decisions. * 

•The third school visited was a high school. Here*some:of 

the school staff interviewed seemed knowledgeable' (in some cases,. 
* - >. * * 

almost expert) in matters of testing and test use,- especially in 

the math department. Indeed, the- school administration e^e^e^ 

iiope -that the model of the math department woul4. eventually transfer,} 

to other departments. To be effective, however, they believed that 

this must occur naturally with no direct interference from the adminis- 

f 

tration. 

In this" school, the principal and associate principal emphasized 
the crucial role of the district in sponsoring within-school and cen- 
tralized opportunities, for technical assistance in testing. This school 



also seemed to exemplify the best uses of certain Id nds of tests. In 

terms of ijie 1TED, its use, as, seen by *he school administration, was 

expressed^ fallows: "We need at least one outside -measure, some- 

thing outside of bur own control ... so we can lust have a benchmark,... 

that we can compare with", in terms of school-level performance. Beyond 

that, item analysis of ITED scores might lead, to'discussion between 

the associate principal and a department chair if test score, trends, 

QVar time,- were consistently poor "in certain areas. "Should this indi- 

* * ... 

cation lead to cour%5 modification? Adding something to instruction? 

Do instructors want to add \his area to instruction? Do they want to 

leave it out because* they don't'think it's important?" This kind of 

discussion indicated a measure of department autonomy or, at least* 

negotiated .decision-making. - • 



68 




In this schooi lVgener^l, andMn the math ctepartment in particular, 

the school -developed measures appeared to be accepted and used by 

teachers. Departmental autonomy % in testing and the inservice' and tech- ' 

• • % + 

nica] assistance made available appeared to have stimulated local de- 

l / a ■ > 
velopment of tests th^t are quickly accessible, fit teachers' practical 

.needs, and have high coraent and classroom relevance. Standardized ^ 

tests were primarily used by the school administration, and seamed to 

be viewed neither Vs a threat nor as an unnecessary burden by the 

teachers. • 

* * * •» 

District Summary * ' 

Thte district clearly had a different approach to testing and 
testing policy than the first two. It appeared that the district establishes 
broad policy for* schools, and the schools, in turn,* set broad policy * 
*for the instructional teams in the elementary schcjols'and the depart- 
ments in the- high schools. Test administration, quality, and level 
1 and 2 uses were ftlso focused at the level of team or department. 
In addition, both -the district central office and staff of the AEA pro- 
vided active leadership "in the development of tests and their instructional 

t 

uses. Policy-'was clear, 'though flexible; it seemed to reflecf an organi- 

zational system whose units could "couple" or "de-goupVe? 1 as described 

in Barik t and Wi.lliams (1981)./ A great deal of the testing appeared to 4 

be "owned" by the school unit of concern — team or department. While 

• , * ^ • \ 

teachers seemed l£ss likely to rely greatly on the ITBS ajid,the ITEp, 
Counselors were available to help Interpret these scores and place them * t 
in the largelr assessment context for individual teachers. 0 ' ^ 
Teacher knowledge of tests and testing appeared to be greater than 



63 



■r 



in the first two districts. There also appeared to be more. ins ervice 

and there was certainly much more technical assistance available in 

the third district. This has led to the development of tests of 

higher quality which apparently have marked instructional relevance -7 

for the teachers. The testing situation appears to tome close to the teacher's 

Tdeal ($s ,tfe described it on page 16). That.is^ the overall testing program: 

0 offers tests oriented to* classroom teachers 

'.° permits teachers to use tests so as to meet their practical * 
^activities and exigencies - 

°"does not force teachers to emphasize tests which do not fit their 
practical demands 

0 permits "teachers to administer/use a variety of tests 

0 is sensitive to the exigencies c^f teaching-as-practiced. 

In this district, further, the merits of different kinds of measures 
were notdiscussed by the participants in an adversarial setting. 
Instead, the teachers, principals, and district officials seemed to 
accept the need for and value in generating information, that will paint 
the big (norm-referenced) picture, that will provide a wide-angle view 
about groups and programs. They did not over-empfiasize this picture. 
They also accepted the need to generate information* about the individual 
♦students and classrooms (criterion-referenced or objectives-based) 
that together make up the big picture. They did not over-emphasize the* 
value of this picture either. # 4p 

They sewied to be using th^ right kind of test to get the larger 
aggregate picture, and a series of other, equally appropriate measures, , * * 
to get a Variety of snapshots with a closer focus and with greater 



HE 



64 



detail, of the separate p^rts of the picture. The district, the 
central figure, has supplied the camera ~ the means to get the % -different 
pictures and takes the kind of shot with the degree of resolution 
it nee ^» ^e schools and classrooms use the same camera, but they 
select a kind of film that meets, their needs, and then choose an angle, 
focus, and degree of resolution sensitive enough to get the Series 
of shots that they need. The end result seemed to. be a montage reflecting 
different degrees of instructional progress among different aggregates 
of students at yarying points in ttTine. 

As with other activities stemming from our test use planning 
work, information collected and analyzed seemed to clarify the most 

critical areas to pursue in our national survey, as well as the mapner 

V 

in-whjch tp pursue thei areas- * ■ . 

The next section of the report discusses the manner in which the 
< * i 

national sample was selected and presents the results of questionnaire 

data collection and analyses. 

> 

9 . . 



* . 65 

THE NATIONAL SURVEY 

r \ Sampling Methodology 

As mentioned previously, we intended for the survey to be national 

in scopeyfo provide both descriptive and inferefrtial data relating to 

a series of practical and policy matters, and were guided by our planning 

work as we conceived of the design for the survey, drafted questionnaires, 

and considered the sampling plan. The sample had to be selected as to 

obtain a national picture of the uses of achievement testing, and we had 

limited resources to do this. Teachers were the primary target of the 

survey because they conduct a great deal of achievement testing and are 

therefore in one of the most strategic positions from which toludge the 

relevance of testing programs in terms of criteria we have alluded to 

throughout this report. In-addition, to collect confirmatory data and 

information on relevant contextual variables, principals of selected 

) 

schools and district testing officers were also selected as study 
respondents. 

The Test Use Project's earlier fieldwork had demonstrated that fre- 
quency and uses of tests vary with* grade* level of students. The survey 
therefore included the fourth, sixth> and tenth grade levels, (Rationale* 
for the selection of these grade levels has been provided in earlier Test 
Use Project reports;) 

Because the focus of much testing is in the basic skills areas, the 
study targeted assessment in reading, language arts, and mathematics. 
Elementary school teachers were asked questions pertaining to both reading/ 
language arts and mathematics assessment. At the tenth grade level, 



language arts (English) and mathematics teachers were asked about assessment 
in their respective fields* 

The survey was directed to two elementary and two secondary schools in each 
selected school district; with two fourth and two sixth grade teachers 1n each 
selected elementary school, and two language arts and two mathematics teachers 
in each selected secondary school. The target was about 400 teachers of each 
type. Snipe many districts have -only one secondary school, it was necessary 
to sample in excess of 100 districts to meet this objective. 

The*sample was selected to be sufficiently representative of the target 
populations as to generalize to these populations throughout public schools. 
Factors that guided the selection included the district's minimum competency 
testing status, student enrollment, SES, geographic region, and metropolitan 
status. 

Because the data collected in the study are being used to provide the 
basis for inferences about the influences of various contextual factors, 
the project was careful to design a sampling plan that would obtain 
general representation over the variables of interest. The conception, 
development, and refinement of the sampling plan proceeded as follows. 
■ The Initial Plan * ' 

The initial conception of the plan was to yifaw a sample of approximately 
100 districts to yield a total respondent sample of 2,100 individuals. 
Allowing for the inevitable shrinkage which occurs with the use of mailed 

questionnaires, this size of sample was considered to be an adequate basis 

• * 
for inferences about' the nature of achievement tests in current classroom 

use. An illustration of kintKof respondent by school district is seen in 



67 



Table 5 * 

Number of Respondents by Type for each District 



Respondent Classification ' 
District Testing Officer 



Number for each District 



Elementary School Principals 



2 * 



High School Principals 



Fourth Grade Teachers 



Sixth Grade Teachers 



Tenth Grade Teachers, language' arts 



Tenth Grade Teachers, mathematics 



(2 in each^pf 2 schools) 
4 

(2 in each of 2 schools) 



(2 in each .of 2 schools) 



(2 in each of 2 schools) 



TOTAL 



21 



<2 



The initial design called for a proportional probability sampling 
(PPS) strategy to draw a sample of districts and schools" with a probability 
of being selected proportional to their size and representation in the popu 
lation by state testing criterion (i.e. , estate assessment and minimum 
competency testing status). 

Duringthis stage of bur thinking, we considered the factors that' 
might be used to^strattfy the population of districts for sampling pur- 
poses; e.g., presence of minimum competency testing, size and location 
of district, etc. In that some strata would have no comparative useful- 
nes"s in the study, our interest in them was limited to ensuring that - 



74 



68 

* « * 

interesting population features would be present in the sample in 

« 

proportion to their representation in that population We felt, at 
this time, that the most direct manner of obtaining a sample of dis- 

a 

tricts was to array them in a nested ordering representing the specific 
characteristics of interest, and then to sample them with a probability- 
proportional to their sizes. 

The major features in this initial conception of the sampling design 
were the minimum competency testing matrix and geographical region. Dis- 
tricts were to be ordered in the cells of the MCT matrix by geographi- 
cal region* The districts would not need to be ordered by size because 4 
•the PPS scheme would select them irv ar fashion properly representing 

this variable, " \ 

Within districts, schools were to be selected by randomly drawing 

one low SES and one middle or high SES elementary ^chool from a list 

of such schools and by s$iilarly drawing the two secondary schools. 

At the elementary school level, lower SES schools were defined as 

those receiving ESEA Title I funds, and higher SES as those receiving 

Wt 

\ 

no compensatory education funding. Aid to' families with dependent 
children (AFDC) was used to define SES-at the high school. 

,* Because a J great many of the districts in*the United States are 
too small "Whave two eleipentary schools or (especially) two high 
schools, districts would be selected with probability proportional 
to size, so as to reduce the likelihood of drawing a very small district. 

The teachers would be selected by draV/ing two teachers at jrandom 
from a list of target teachers at each desired grade leve. At the 
elementary school level, target teachers were defined by grade level 
taught. At the secondary level, target teachers were defined as those 



69 



teaching the subject area classes with greatest enrollment at thair 
school; i.e., the most common mathematics and language arts classes 
for tenth graders* If time pressures could not be controlled, an algo- 
rithm for making such a selection would be described to the principals 
who would then make the selection. 

• -> 

The samples obtained by this method would represent the responses 
of the "typical" teacher or principal, in that larger places (employing 
more of these people) would be more likely to appear in the sample than 
would smaller places, while the probability of including specific districts 
w&uld.be inversely related to their size. The net result would'be that 
all teachers would have .about an equal chance to tie selected into the study; 
the same. would be true of principals. 

Respondents indistricts offices, however, would not have equal, 
probability of selection, and if their responses were to be analyzed 
witjiout -weighting for selection probabilities* they would represent 
responses characterizing the environment of the "typical* student 11 (as 
if we had selected the district officers with equal probability and 
weighted for district size). To obtain a characterization of the 
"typical district," it would be necessary to weight the responses 
by the selection probability (found 'by taking the ratio of the district's 
^enrollment used in the selection procedure to the 'total enrollment 
ov6r a 1.1 districts). . 

Our thinking at this stage was'that the most desirable analyses 

would be those involving no weighting (except" as may have been needed 

. * 
when dealing with very small district ),* and the^sampling would reflect 
* * 
concern for^ representation of the primary target population teachers. 



70 



Revised Sampling Design 

* * 

Toward* the end of 198CL, the initial sampling plan underwent a 
process of internal and external review. During the review process, 
it became clear that while the plan could be improved, certain features 
of , the .initial Rlan should be retained. For example, teachers remained 
as the primary focus of interest MCT and geographical region were 
still used to define districts; How andohigh SES definitions of schools 
were as previously "described in this section; the numher of respondents 
of each -type by district would remain constant (.as seen in Table 5)u 

However* the review process revealed that while the PPS-based 



cjesign would be adequate for the study's descriptive purposes, its 
capacity for allowing analytic, policy-relevant comparisons was o limited. 
A series of project meetings led to the decisidn to replacethe initial 
sampling procedure by ^probability lattice methodology: The sample so 
produced, with minimal weightings, meets both the descriptive needs of 
the study — to provide a nationwide picture of ^assessment practices- 
and Oses, and fts analytic needs — comparisons by SES, MCT, within « * 
districts, etc; » ; 

As was -the case with the original design, in the revised plan 
1,600 of the targgt population consisted of elementary and secondary 
teachers; approximately 400 were school principals, and another 120 

were district testing officers. 

' * * 

•The sampling was conducted in three stages: • ' 

'(1) selection of 120 districts from a highly stratified- sampling 
\ ' -frame / ■ % . ^ 

. i.(2) selection Vf two elementary and two secondary schools 
. . (size permitting) from each district 



71 



(3) selection of four teachers from each selected 'school 
The selectiorTof each stage was devised so that collectively the three 
"stages produced^ sample of* teachers that was approximately self- 
weighting; that is, the overall selection probabilities for teachers 
was approximately equal. 

First Stage (Selection of Districts) - ' . 

Our sample called for a relatively small first-stage sample from 
a highly stratified sampling frame. With conventional stratification, 
the number of strata cannot exceed sample size, thus precluding 'its 
usefulness for'oqr purposes. Although a number of stratification 
schemes introduced ov^the past thirty years -do not* have this limi- 
tation, most require symmetrical joint distributions over the stra- 
tification factors, which generally are not present in naturally 

occurring populations. Jessen (1970) has presented severaTschemes 

y 

that do not require such symmetry under the collective label "proba- 
bility lattice sampling." * / 

With probability lattice sampling (PLS) (Jessen, 1970), we were 
able to obtain a sample>that was similar toHatin square experimen- 
tal design. The/Sampling universe wSis stratified into several levels 
for each of several factors, and the sample th§t was selected simul- 



taneously represented eaeh level of each factor in predesignated pro- 
portions. This result 4s obtained with probabilitfes-proporjBaal 
to. size even though the cells formed, by tfce multiple stratif 1«Wor*\ 



have different *measures-of-size '(MOS). Indeed, most df the* cells ^ 



1 



n our Sampling fretme had *ero MOS, (I.e., were*empty). 



78 ■ 



Public school distrtfcts were the sampling units for the first. , , 
>NS ^6tage. The sampling population excluded Alaska and Hawaii, as well, 
as districts that are not unified. The data source was District File , 
a listing of all U.S. public school districts by Market Data Retrieval 
(MDR), 1980. The school districts in the sampling population numbered 
13,815, with a combined reported student 'enrollment of 41,589,605. ■ 
Stratification 

Five stratification variables were chosen to enhance the analytic 
- and descriptive qualities of the sample: 

(1) : minimum competency testing status • 

(2) size of student enrollment 

fib 

(3) SES of attendance area 

(4) . geographic region , 1 

(5) metropolitan* status 

Minimum, Competency Testing Status . Districts were categorized 
according to the status of minimum competency testing (MCT) in their 
. respective states. Thi-s .categorization, based on Gortji (1980) and 

Kaufman (1979), reflects whether a MCT program exists, whether MCT » . 

is a requirement for promotion or graduation, and whether the state 

' * * 

allows local districts the option of' designing or selecting the, tests 

to bM^ed. Thus; there were five strata: 

'^f* ' • ' » § districts % total enrollment 



(1) MCT not required for graduation 2703 a 19 
or promotion; no local option 



(2) 


MCT -not required for graduation or 
promotion; local options. 


2065 


(3)- MCT required for graduation or 
promotion; no local option. 


. 980 


(4) 


MCT required for graduation or 
promotion; local options. , 


1778 


(5) 

» 


Nq MCT program mandated. for 
implementation by 1981 at 
state level. 


6289 
13815 



districts. % total enrollment 

13 

ia 

16 

. 34 




100 



Size of student enrollment . The. enrollment strata were designed to 
assure representation from very small , small, medium, large, and very large 
districts. In setting the class limits, special attention was paid to 
• previous CSE research that found that the organization of district admini- 
stration and uie of resources ,in testing is significantly different for 
districts that are^bove certain size thresholds (Lyon, 1978). The five 



strata are: 



# districts % total enrollment 



(1) * - 4,999 

(2) * 5^00^ 9,999 

(3) 10,000 - 24,999 

(4) 25,000 - 44,999. * 

(5) 45,000 - 



V366-1 


* 37 


1059 


18 


514 


" 18, 


105 * 


, % 8 






76 •* - 


19 



1 >• 

Of these xfive strata I numbers (2) - (5), are identical with those used in the 
Lyon study just cited. ' An additional, smaller strata (1) was addedjiere to, 
assure representation of smaller' districts. , , 



74 



SES of attendance area . The MDR data file indexes school districts 
into four categories based upon calculations of the Orshansky Index. These, 
categories were collapsed into three strata: » * 



# districts % total enrollment 



(1) .1 - 4.9% (wealthiest) 

(2) 5.0 - 24.9% 

(3) 25.0 - (poorest) 



1907 
9051 
2857 
138T5" 



16 
69 
15 

TOO, 



Geographic region . Four strata were defined in order to assure repre- 
sentati on across the continentarl United States: 

• •* " (1) Northeast — Connecticut, Delaware, district of Columbia, 

• / .Maine, Maryland, - '* New. Hampshire, New Jersey, New 

York, Pennsylvania, Rhode Island, Vermont. » • 



2718 districts 



25% of total - enrollment 



(2) Southeast — Alabama., Arkansas , Flori da , Georgia , .Kentucky, 
Louisiana, Mississippi, North Carolina, Sauth Carolina, 
Tennessee, Virginia* West Virginia. •• 



.17.36 districts 



24% of total enrollment 



(3) Middle- - Illinois, Indiana,' Iowa, Kansas, Mi ch-igan,. Minnesota, 
Missouri, Nebraska, North. Dakota, Ohio, South Dakota, Wisconsin. 



5279 districts 



27% of total -enrollment 



(41 West —'Arizona, California, Colorado', Idaho, Mdntafta,. Nevada, 
* ' New Mexico, Oklahoma, Oregon, Texas, Utah, Washington, Wyoming. 

25% of total enrollment 



4092 districts 



(These divisions are identical to those used by the Rational Education 
^ociat1op ?n its.-an/iual survey of teaphers' activities and opinions. 



) 













• 








* 




• 




\ 75. 

* * % 

» < 


/ . Metropolitan status. 


The MDR data file groups sjchool districts into . v 


• three levels of metropolitan status, 




These groupings were adopted as strata 


• 

to reflect different degrees of urbanness 

' « « 
• 












S 












t 










# districts %' total enrollment 


(1) Central' City 












■ 










915 


31 - ' • 


(2) Urban Fringe 






















3354 


32 


' (3) Non-metropolitan 


















9546 37 
13815' 100 


Sampling Frame 

From the five stratification. variables, 


a 


900-cell matrix was fashioned 


... ' that has 75 rows and 12 columns 


9 

0 
• 






m 






* 






J 




\ + (region) x (metro status) 
„ . ' .. " 12 columns 


r 






























•m > 




























♦ , 




























9 


























o 4 


9 




























(MCT) x (size) x (SES) 
.75 rows . 


























































































































f 












s 4 






























■ • r * 






















































- ♦ 




























• 
































































* 






















> > ... 














• 

82 































- a 



76v 



Upon allocating Aie 13,815 districts. in the sampling population. among , 

the Cells of this matri'x, nine full tows were foynd to be unoccupied, 

• > • 

*as were ari- additional 436 cells. The occupied cells, then* numbered 

♦ 

Sampling from 'this matrix in such a manner that each occupied , 

* * ■ 

row and column (but not each cell) is represented had' the effect 
of crossing the two column factors in factorial fashion, ^and simi- 

larly crossing the thfee row factore in "near" factorial fashion; the-n 

• * * l 

exception, of course, is that nine. combinations of the row factors did 

•~* * ' . t 

not exist in the population of districts. The methods used to achieve 

* • % * % 

* » 0 4 ' « 

this sampling are described below.. 

Selection Procedure . 

- The sample size of districts was set at 120 rather than 100 -as 
envisioned in the original sampling plan. The larger sample size partly' 
offset the fact that many of the smaller districts have only one high . 
school , and some may have jast one eleiffentary school. 

The selection ^probabilities for districts were set proportional 
to~a measure-ofrsize (MOS), namely, student enrollments reported in 

the MDR data ffle* With pr.operly coordinated seTtfction probabilities 
jit the successive two stages, our sampling of teachers theoretically - 

could have been self -Weighting (i.e., equal probability) without the 

* * * « f i 

inconvenience of a. highly variable sample size, for teachers., However, 
as amp! ifieyd* l iter in this section analytic an3 cost* considerations led 
uV to modify toe, procedure so Jbhat the sampling in self-weightijig wafi 
approximate, ra ;her thap exact. . . 



'83 



77 



Weighting . Tb enhance the analytic characteristics of the - « m 
sample we undersamp4.ed from two strata: 

: (1) MCT Stratum* #5 ~ no minimum competency testing program at 

state level'. - - 4 * 

* • • • 

" s (2) Size Stratum #1 — districts with enrollment l6ss than 5,000. • 
Undersampling from MCT. Stratum #5 permitted the selection from the - 
strata of . correspondingly more districts that have greater analytic interest 
i.e., those with some MCT program in force or to be implemented by 
1981; A target of approximately 20 districts in MCT, Stratum #5 was 
accordingly set, and the weight for this stratum was set at 0.6. A. 
target of approximately 25 districts was set for Size Stratum #1, in x + 
order to avoid over-burdening the sample with small districts, which in 
some respects are of less interest to this study because they draw fewer 
federal and state dollars and alloeate fewer local resources to testing 
(Lyon, et al, 1979). Accordingly, the weight for 'this stratum was * 
set; a t 0.7. . In order to accommodate the£e two weightings into the * 0 * 
sampling frame, weights were required for other cells as well: cells 
that were jointly in -MCT Stratum #5 and Size Stratum #1 were weighted 8y 
0.42; cells that were- part of neither of these strata were weighted by 

jThe/iirtpTioatipn of .the above weighting scheme was to specify a^ 



Sample 



ofdiV 



VictS/thai was distribii 



ed across the various strata as 




depictid ipwj>W& Tabled also includes the actual sample allocation 



that resulted!- 



Cell selection s ihk first-stage ielbt^ion of school districts was 

* • * ♦ 

'actually accomplished iri'two J, $tfb^9tages; —.120 cell selections from 



, Table 6 - 

Allocation of District Sample Among Strata 



78 







v expected 


Variable 


Level 


Allocation* 


MCT'Status 


1 


• 27.1 




2 


18.2 




3 


28.5 - " 




4 


25.4 




5 


20.9 








Pnrnl 1 merit 

1*111 \J 1 1 HIGH I* 


1 

i 


25 5 




2 


25.7 




v 3 


26.7 '■ 


\ 


4 


12.3 




5 . 


29.8, 


• ? 


i * 


17 ft 




2 


'85\4 




3 


- 16.9" 


Region 1 


1 


31 .4 • 




* 2 . 


33,. 0 




3 


27.5 




4 . . 


28.2 * 



Actual 
Target Sample 



Responding 
Districts** 



Metropolitan 
Status 



47v4 
35.. 9 
36.7 



01 
CI 


22 

Cm Cm 


18 


17. 


28 


21 


' 26. 


16 


21 


15 


25 


19 


26 


> 22 


27 


22 


1(2 


9 


30 


19 • 


t 

17 


1 5 


86 


. 61 . 


17 


<*. 15 
' % 


-31 


22 


33 


28 • 


27 ' 


22. 


, 29 


19 






47 




36 • 


27 


37 


31 ' 



fThe fractional portion otf the allocations shpuld be interpreted as the 
probabilities with which an ^dditponal district should be selected from 
tl]ie respective stratum.^fW^xampl.e, 27 districts are to be selected 

\f^om MCT Stratum- #11, with a Wen percent chance of selecting a 28th 

rf fstr1ct - . i ; ^: : w<.. ... \ , 

♦^Corrected weights tiorrespondii\g to. the- ffgur.es;. in this column will be 
incorporated and used throygtiout The anafys^s of- the final report, i . < 
Preliminary results reported in tj3is cjocument were computed using, equal 
weights. • , 



79 



the sampling frame, then the selection 'of one district for each 
c6ll selection. 

The previously described PLS that we used required us ^^ar- 
bitrarily designate a "feasible set" of lattices for the frame, 
each *of which satisfies the pre-designated quotas for each stratifying 
factor.- For example, where the stratification quota is three, each 
feasible lattice designation is required to include three non- 
zero cells for that stratum. Each lattice was assigned a- selection - 
probability. The number of lattices in the feasible set and their 
selection probabilities are not jointly artibrary, but are determined by 
a set of/decision rules that guarantee that the sum of probabilities 
for all. lattices that incTude a particular cell is proportional to the 
MOS for that cell. Finally, observ^ the assigned selection probabili- 
'ties, we selected one lattice frqgo^he feasible set to obtain the 
sample of cells! 

Since some of the larger cell^ were designated more than once (in 
accordance with c^ll size), the total number of district celTs in our 
selection, lattice was only 98. < 

% Selection of districts within cells . School districts were sampled 
from selected' cells with probabilities proportional to MG^ (again, . 
district enrollmtent)v The propedy^e was to .list the districts within 
cells in alphabetic order, cumulate the M0S> then 



select a random number < 



between one (1.0 1 ) and the total MOS for that cell.' By matjching the- , 
random number of the cumulative MOS, the sampled district was identified. 
This prdcess'was Repeated in. cells selected fflore' th^an once. - 



Anticipating that some districts would refuse to participate in 
the study, a-nonresponse strategy was developed. At the district * 
level, non-cooperating districts were replaced from the same cells from 
which the* refusals came. 
Second Stage (Selection of Schools} 

As noted earlier, the sample design called for selection with pro- 
babilities proportional tT^MOS^sf two elementary and two secondary* 
schools from each selected district. (Many of the smaller districts, 
of course, yielded' only one secondary school J The procedure for 

this selection was^as follows. 

I ^ 
'Before 'tfie initial district contact, a list of schools and their 

- * 

enrollments was obtained from "data files prepared by the Office .of Civil 

Rights and tne National Center for Education Statistics, If the district 

/ * * • \ 

<|s large, A pre-selection of eight elementary and four secondary schools 

^**^ i 

Was made using systematic sampling to select with probabilities pro- 
portional to size • . 

A protocol was then designed to structure -initial telephone con- 
tacts with officials of selected districts. During the' course erf the 
district'eontact, the list of pre-selected schoote was read to a district 
official*, who was asked to^ank them according to percent AFDC,* percent 
receiving free-Junch, or /another locally salient poverty* status variable. 
A cumulative list of the/HOls for' the ranked schools was calculated, and 

• L ' V , • * i 

a-systematic sampling w^s u^ed to make \he selections with ^probaC|f li ties 
proportional to M0S.,-1n the case of large districts where a pre-selectio 
was^nade as d^rTBed above, actual selection of the four schools to be* 
included in the sample was made with equal probabilities since a MOS 
selection had alreacjy been flade. Tfris • selection process had the effect 

87 - 



of stratifying within the district on the basis of poverty status. 

Where a school declined to cooperate (or the district refused to permit 

) , , 

sampled school to participate), another school in the* appropriate 

poverty strata was chosen from the pre-selected subset. 

Third Stage (Selection of Teachers) 

The four teacher types^fourth grade, sixth grade, tenth grade - - 

language arts, tenth grade mathematics) were treated as separate populations. 

As- noted earlier, a sample of two teachers was targeted ' from each 

type. > , 

» • 

In order to. complete the self-weighting nature of the survey sample, * 

t 

it would have been necessary to collect lists of the four teacher types 
for each, school, or at least the numbers thereof. Selection probabilities 
would then have been calculated as functions of the MOS used in the 

4 

second-stage selection of the respective schools: 

200 ' . 
(MOS for school) 

where K, wlp'ch is equal to 12 times the. national average (or typical) 
pupil /teacher ratio, accounts for the fact that MOS 'in' the first stages 
was based on student enrollments rather than numbers of teachers. Ttyjs, 
the overall selection prtabilities would have been constant except for the 

i , * , • ' * 

effect of the>stratum weights applied in the first stage sampling: 



T00(W. )(di strict MQS)" 2 (school MOS) 2(K) • 400(W..')(K) 

^(district MOS). ~~ x (district MOS) x Ischool MOS) = (.district MOS 



where W. is the stratum weight. 



The above procedure would have provided the expectation of two 

teachers of each type from each school with just four* distinct values for: 
W.j. However, that procedure had two significant drawbacks: (1) it would 

have required additional phone contacts before sampling could be completed; 

* i 

and (2) the variation of sample size from each school could have allowed one 
or more respondents in a given category to be selected ^ 

Because of these problems,' we devised a procedure,/ based on previous 



CSE survey research, in which the respective principals were provided wi 
a systematic sampling protocol enabling them to se]eqt two teachers at / 
random from the appropriate categories. The variability in third- , 



7 



ith 



stage selection probabilities has slight if any Effect, and the c<?s't 
and time savings, as well as analytical advantages,' are sigipffcant. 

Principals were provided with extra questionnaires and -directions for 
selecting,, where possible, alternate respondents in the case of teacher 
non-response. * . , 

Questionnaire develotfheht . Questictan^ire development drew upon the-'' 



theory of test use and the conceptual scheme previously described. De- 
velopment efforts were informed by the experiences^ of and information emanating 
from the various project planning activities. These sources enabled us 
to draw up specifications to describe content areas that the questionnaire* 

items would tap. From these specifications, ftems sets were constructed 

j 9 i 

'for] the^ teachers' and principals 1 questionnaifes. XI 

Draft questionnaires were reviewed by a. variety of experts pnd prac- 



titioners 3s described in our interim- report totheNIE, January 1981. In 
addition questionnaires were piloted with principals and teachers (N=50) 



in three phases in both a large, urban school district and iji a small, 



•suburban 'district in the Southern CaVifbrnia ire$> Each?"of th&;thEee. 

"pfTotrphases, in con^nc^iofT. wi th expert reviews ^provided information that 

was used tburfake successive revisions of each of the questionnaires 
Teachers and principals participating in the field test were selectively - 
interviewed abput the instrument; all completed summary and item-by-item 
questionnaire review forms. ~~ . \ * * 

As a result of these procedures,^ 11 of the draft-questionnaires were 
Extensively modified, so as to focus jno^e effectively on the information 
needed by the project while minimizing the response burden for individual* 
respondents. The questionnaires were formatted so as to facilitate coding 
and data processing. \ / 

The teacher survey was .constructed in two forms, elementary and secon- 
dary, to accomodate the obvious differences in class structure* At the 
elementary jkvelv* the questional re asked^teachers about testing in both 

^.mathematics and' reading/^ At the secondary' level , the questionnaire asked 
mathematics and English/reading teachers about testing activities and test 
use, only in their subject field/ Both teacher questionaires contain a 

■ i • 

common core of questions about perceptions, instructi onal organ ization, 

C , ft 

training and experience, leadership, and other contextual information. 

* 

The principals- 1 questionnaire is not differentiated by. school l^yel ; 
it jioes differ, however, from the teacher questionnaire in the types^ 

it^ refers so as to further reduce the 



of tests and., the uses to which 

response burden for teachers ahd to provide a me^ns of confirming their 



contertt statements. Copies of the questionnaires in their final form 
are appended to**£hts 'report v 



90 



Contacts- with districts and schools ., Following the revised sampling 
design, 120 school districts were selected during the first stage. (One 
* himdred-.fotjrteen school, districts were actually selected. Since the 
probability of a*cjiv£n districts selection was. proportional to a measure 
of size, several large districts were selected more than once. In these 
^istricts, the usual set ^schools to'be selected per district— two 
elementary schbols and two high schools—were increased times th£ number . , 
of. times the district was selected.) 

Initially Qa£h sampled d.istrict~was contacted by phone, and once 
the appropriate membfer of the district staff was identified, three 
matters were dealt wfihr 

(a) The agreement' of the district to participate Was obtained. In 
many castes J;bis was not granted until district personnel hacj 
had a cfoaoce to re^ew-%fce overall objfectivesfof the project 
and~the actual questionnaires to be used. This introduced 
son\e delay iato°th& procedure, but-the view was taken that the , 
district was entitled to receive*the fullest information -about 
the project>\fta€ the -team' couTd prrovid?. . - 
(j>) The assistance*^ the district in .selecting schools in .accordance 
with the sanrplrn'g frame wa£ ?ought. Once schools*were identified, 

procedures ,fpr obtaining "the cooperation *$f the principals 

> * * ~~ « . 

were discussed. In some cases, the- -district made all. the . 

J arrangements and requested $iat ;*£SE*end all pafiKets b| ^uestion- 

"7 ~ naires to- the di$t^t==f0^disttfib^^ .In, the^majorfty of . 

districts, howeVer, it was agreed fhat'CSE would contact the 

schools direcllfy. 



• > 



(c) An indepth interview was conducted to determine the district's 

#> * ,< — 

* . policy regarding assessment, mandated testing programs, 

inservice training for teachers in evaluation procedures, etc. 

- * : v Detailed information regarding the size, structure and demographic 

, characteristics of*the 'district were also recorded'. 

Is 

Although in some instances a single telephone call, was sufficient, for most 

* <.» 

districts it was 1 necessary to place a number of calls to several members V ; 
of the^loea.l staff* : * r • 

Questionnaires for elementary and secondary "teachers-'and their - ~ , 

' * - . *. 

principals were then sent to, the selected districts in the following 

4 ' v * ^ 

manner. A package containing.the principal's and teachers 1 .questionnaires 
was sent to the principal %f eacl? selected school Cor as noted* above ^ 
to the district headquarters). The principal, in addition to completing 
and re^rniritj 'his/her own questionnaire, alsq>dT?fc*ril)uted tfie question- 
naires to the teachers following the method des cr i be^^h ^t he pr o toco Is*. 
Teachers 0 then completed and^returned their questionnaires to*CSE: - ; , 

Returns from th& first r^und of the survey -were disappointingly low 
(see Table 7). Problems in printing the questionnaires and in getting, 
approval for some districts. to participate 4jn tbe study, combined w/ith 
unanticipated delays in t£e mafl^pd «a$ district- offices meant that a 
sutetatitial number of the\ sampled schodls received the packets of ques$ion- 

• * * * 'il • \ 

haire material very close'ta the end of the* school *y6ar. In some schools, 



the quesfiorlnaf res did not getnnto the hands of the' Selected teachers- 
.before the - beginning of the summer vacatidg^and other schools indicated 
to CSE . that/ they would be unwilling*; to provide the requested Information 
because of me pressure ptf other worlcy ^ \ . • ^ # 



^ ^ Response Rates, - 



Responses (Percent Responses '.(Percent 
Target received of . . . .received - of * 



Districts- % 
respbnding 



Principal's 
Questionnaires 



Secondary j . J 

Teachers. 

Questionnaires 



Elementary - 

Teachers 

Questionnaires 



Sample, 



114 



400 



800 



800. 



by July 31 .target) , 



: ,53- 



144 



■244 



305 



(46%)- 



($6%) 



(31%) 



(38%) . 



by Nov. 6 



•target) 



91 



222 



~' (80%) 



372 



488 



' (56%) 

f 

(47%) 
. (61%) 



It was felt that these facfSrs accounted for the Initially low response- 
rates. However, this same low rate jeopardized the validity of the sampling 
and hence the credibility of the-jresults,.and so it was decided to extend 
the* deadline for the return of questionnaires into the fall. Schools that 
-did not respond in the May/June, period were contacted and encouraged to 
send back completed questionnaires in September. Replacement questionnaires 
were mailed to those schools whfcr. reported having 1 mislaid, or not having 
received the original packages. In addition, thel opportunity was taken to 
substitute districts and individual schools that bommunicated to CSE their 
.decision not to participate in the project. These* were replaced wi^r others- 
drawn from the original master list and using the same sampling proc 



Thus, the survey was" essential ly reactivated at the end of August and 



sdures . 
during 

September, and a marjor* effort was" made to expand the pool of data tha't 



87 



would be available for analysis. Even so, schools proved slow to responds 
and questionnaires were still being returned at the end of October. ■ ' e*. 

' '..To obtain thfc mostccomprehensive and reliable picture of the national 
|cene w'ith regard to^feist use, it was considered important to include as . 
many completed' questionnaires as possible in the main analysis. For 
this reason, the deadline for^the return of questionnaires to be included 
in the statistical analysis was delayed until November '6. Table 7 gives 
then'umber of completed questionnaires received by that date, and that ' 
constitute the sample^or~th"e"mirt{ra"nTrysis. The responseTates , even-at -8 " 
the cut-off date, were still comparatively low, but since'a certain 
shrinkage had been anticipated in drawing up the sample design, it was 
considered that the number of returns. was sufficient to permit the. 
original analysis to proceed., 

The uneven pattern of response, however, casts some doubt on the 
validity 'of the achieved sample and certainly ensured that the intended 
simple weighting design outlined in an earlier section of this report 
^lould not be adequate. Unfortunately, the checking of the validity of 
the sample and the calculation"^ the weights to be used in the main 
analysis, could not be carried .out before the cut-off date for receipt of 
questionnaires/ and so t^e final analyses have not yet been completed. 
However, in order to test oufthe file handling, *^ta editing, and 
stitistical procedures, and to give an indication of the major trends in'' 
the clata, most of the intended analyses have been run on the subset ^of 
responses" that were received by July 31. Selected summaries of these 
are presented in the following pages, but it must be noted that th|se 



" 94 



/ 



88 



initial artalyses*were carried- out using unit weights for each district * 

so they should be regarded js illustrative rattverythan as representative 

of the' pattern of results to be expected tnTfie- full reports. 

Analytic framework . It will be. recalled from ouf* 1980 report to the 

NIE that the projects domains of interest Cas depicted on page >7 in the 

'present report) and data collected in the project's planning stage 

* ; • 

generated a series of hypotheses or questions to be\explored in- the 

* * » * 

national sufvey. ' These areas are summarized here to serve as background 
for the discussion of survey findings. 

Federal /state/local testing requirements influence the distribution 
and frequency of types of testing at local sites, and thus bear upon . 
patterns of test use. Testing. interventions such as minimum competency 
testing, therefore, may impact on the organization of curriculum and 
instruction. - ~ 

The organization of curriculum and instruction constitutes a major * 
influence, on the nature of teachers - routine, practical activities and * 
decisions. We hypothesized, therefore, that a greater variety and, number 
of available instructional alternatives in the classroom and school will * 
increase the routine tasks and decisions that require assessment informa- 
tion, and so influence both the patterns of testing ttiht occur locally 
and the Ways test scores are used locally. 

The nature of teachers 1 routing practical activities and decisions 
is assumed to wry with the types of students enrolled in the school /• ~ 
and assigned to a teacher's classroom^ Thus, the* types of tests given ^ 
locally and the uses of. test results are likely to vary with the demographiV 
and achievement characteristics of students in the school and ^classroom. 



I ERIC 



As teachers go aboutfthe accomplishment of their practical tasks and 
decisions, the instances io which they refe'r to test scores -and the ways 
in which they "weight" test scores are assumed to/^ary with their perceptions 
J[opimons, values, understandings) of tests-' and types of tests. 

As teachers assess particular tests '^strengths and weaknesses and 
their appropriate uses, they will draw upon their educational and "practical . 
experiences with respect to testing. Thus, their training a<jd experience 

\ ' * n , . 

are likely to bear ultimately on their practical decisions about which 

type of test scores to use and how to use them*. » 1 . 

We as.sume that innovative district and school leadership can provide » 
inservice training' experiences that change teachers' perceptions, of the 
utility of particular tests and types of tests, thus influencing teachers' 
practical test-use decisions. District and school leadership can also * 
a^' to generate tests, testing programs, and practices that facilitate 
teachers* accomplishment of their*r6utine- tasks under the practical exigencies 
of their environments. . 

The types of tests given locally . ' and/the purposes .for and frequency 
with which they are given, will influence local types* of test scqre useP 1 
The presence/absence of one type of test may influence the use of scores 
from another type. For example, the use of minimum competency tests for 
graduation may encourage teachers to use thel results of other kinds. of 
tests to meagre "students' progyess^toward tfje jattainment of ininii 

. L 



competencttes. 



imum 



90 



• " • Tft£ ELEMENTARY. SCHOOL TEACHER SAMPLE 

This section of the report provides spme background information on 
the elementary- school teacher sample and some, of th£ characteristics of 
'their classrooms* It also< discusses the degree to 'which teachers make use 
of various resources, the kinds of*assistance oft matters of -assessment 
they receive frofo their districts, school or district level training they 
receive in testing and assessment, district use of assessment vis-a-vis 

. teachers 1 instructional practice, and patterns of district reporting of 
test results back* to teachers. This information is offered as< precursor 

, to a subsequent section dealing with teacher attitude toward 'testing and 

their reported uses of assessment results. ' 

> > - , . * . 

Teachers 1 Professional Background * * 

m .The first section of the elementary school teachers 1 Questionnaire 
asked respondents a series of ^questions about their professional background 
'The ffy^x of these questior^s^ealt witli^he number of years the teacher 
v liad k been teaching. Table 8 below illustrates the responses to this 

question, with ,years of teaching primarily broken down into five year perio 

— ' • » 

Table 8;- 



/ Years .of Teaching 

. ^N-304) • ' . ' ' 

1 • • . / 

Number of Years * . Number" 1 of % ' .«. , • 

\ Teaching . ' ' Respondents' " % ' « Percentage 

1-5 ,54 * • ,17 

6 - 10 \. ' 83 ' ' , 28 

- 15" • 7\ j : l 23 

16-20' " 42 * ' 14 

21-25- ■ • . ^.27 *' ; , . , 9 , 

26 - 30 20 • 6 

31 - 40 * ' • :r ■ > • 3 . 



ERJC 



1oo 



This question* was follQweckby an -item^ asking Vespandents how many' • 
years they had been teaching in their present district, table 9 below 
indicates patterns of tedcher ^responses to this Item, using the same 
breakdown of years, as followed in the preceding" table. „ 

• • Table jj . 

„ Number of Years Teaching in the Present District 
~ (N-304) : 



Number of Years 
in District 


Number of 
Respondents 


v. 

Percentage 


1 - 5 


'-98 


32 






6 -.10^ * 


72 


24 


ll -615* 


76 • 


• .'25 - ' 


16 - 20* 


. ' 30' 


10 


2V - 25 


- 12 


4 - 


26 - 30. 


.11 


3 


31-40 


5 • 


2 




304 


'l 00, 



These data indicate a certain amount of stability among the elemental teacher 
population. Beyond the*32 percent v^ho have been in their district for 
one to five years,- an additional 25 percent have been in their district 

■ for six .to ten years and 25 percent for eleven to fifteen years. An ■ , 
additioria] 10 percent have been in their present district for 16 to 20 years, ' 
wjth the rertafning 9 percent serving in the same district in excess' of 20 
years. . t 

• % The next questionnaire item asked teachers the highest diploma "or 
degree they have received. Of the 298 respondents-* 178 [58%) had received 
a bachelors highest degree-with the remaining 125 (4£%) reporting a 
masters as highest degree; none of the respondents indicated .receiving a 

.doctorate. 4 * . . % 



The year that respondents, received their degree is indicated in 
'Table 10 below, which again breaks down year af degree in five-year periods 







Table 10 * 






Year Deqree Was Received 








(H = 297) 








Number of 




Years 




Respondents 


rci ten tuyc 


1935 - 


« 


> 

7 
# 


3 


1946 ■; 


50 


8 


, 3 


195 V- 


55 . 


12 


4 


1956 - 




20 


7 


1961 - 


65 


33 


11 ' 


T966 - 


•70*. 


49 


17 * 


1971 - 


75 


79 


26 


1976 - 


81 


89 


30 






297 


100 



These dai^, would suggest that the elementary teachers constitute a 'fairly 
-youthful population, with 56 percent havitig received their highest degree 
in the last 10 year's, and with almos^ 75 percent having received their 
highest degree in the last 15 years. ' \ . 

; Two-hundred thirty-six teachers indicated that they had received 
additional credits/units beyond their last degree, with a median value of 
23, and 16 teachers reporting 100 jor more, 
v 

Classroom Characteristics 

Of the '305 teachers responding to the questions on the number of ' 
grades in their: regular" classroom, 273 (9055) indicated that they teach / 
only one grade; 22 {7%) that they have two grades; J (2%) that they teach 
" three grades; 2 (1%) that they have four grades; and 1 teacher reported 
having five grades. The following, picture, as indicated in Table 11 , • 

* * * « 

reflects, the "modal" grades taught by the responding sample. 



93 



No. of 
Teacherst 

Percent 



Grade 
' 1 



5 

2% 



Table 11 

"Modal" Grades Taught 
• — Z (N = 30b) — 

Grade Grade G rade 

2~ ' 3 1 



5 

2% 



n 

"4% 



134 
44% 



43 

.14% 



104 

34% 



Grade Grade Grade 
5 6 7 



2 

1% 



Of the total sample, then, 281 teachers, or 92%, clustered around grades , 
four, five, and six— the targeted grade levels of CSE's national survey,. 

The teachers were also asked a question on the average numbers of 
students they presently have in their classrooms. Table 12 below indicates 
the patterns of average number^ pf students. 



Table 12 , 

Average Number of Students in a Classroom 
: (N = 302) ! 



Number of 
Students 

(up to) 1°5 
16 - 20 
21 - 25 
'26-30 

• 31-35 
36 - 41 
41 plus 



Number of Teachers 
With This Size Class 




Percentage 

1 

6 
25 
• 45 
19 

2 

•2 



It would seem that the ."average" teacher has a class consisting of 
26 to 30 students, and that the great majority of the teachers have 
glasses comprised of 21 to 30 students. 

The teachers also indicated on the survey their current ^teaching 
responsibilities in readin/and jnath. Of the 301 respondents tp this ' 
item, 247 teachers C82%) teach both reading and math;- 14 teachers C5%) 



r * - : 

94 

indicated that tbey teach math only; 36 teachers 02%) that they teach 
reading only; and 4 teachers 0%) that they presently teach neither- 
Most teaGhers devote four to seven hours a week on reading, and four to 
six hours a week on math. In terms of the different curricular levels at 
which they must teach'in a given classroom;, most teachers reported having 
students representing three to fiVe different reading levels; in math, 
however, most teachers reported having only one to three different 
student levels. 

Use of Resources 

The next item on the survey asked teachers &' variety of questions 
dealing with specific resources that they may use in 'the classroom. 
Response rates for these questions ranged from 2 s 60^to 290. The following 
picture emerged from the.teachers 1 responses about their use of these 
resources. In terms of teachers* having another adult under their super- 
vision to help with small group/individual student work, almost 60ft 
indicated that this^resource is not available to them either for reading 
or math. Another 10% indicated that such a resource is available, for 
hoth reading and math, but is not used, while 2%, for both reading, and 
math; indicated that an ?dult is available but used very infrequently. 
A few teachers 0 to 3%) indicated for reading and math that an adult 
might be used once or twice a month, while 20 to 25 percent indicated, 
again for reading and math, that an adult aide, might be used once or 
twice a week. 

i 

Another item in this series on the survey asked teachers -if they 
can divide up students for extra help among other teacher^. Forty- 
five percent of the teachers indicated they Jo not have this resource * <. 
in reading, and approximately 55 percent that they do not have it 



95 

m 

• . ■ c . 

for math./ Nine percent reported that the resource is available in 
reading but not used, and 12 percent that it is available in math but 
not used. Again, a v few teachers reported- using additional teachers a 
few times a year in reading and math he\p; and about 34 percent reported 
this practice once- or twice a* week in- reading, and 25 percent once or 
twice a week* in math. 

In terms of the availability, of instructional machines, such as 
audiovisuals\ computer terminals, etc., for student^ 1 independent work, 
approximately 35 percent of the teachers reported that they are not 
available in either reading or math; another 10 percent rep6rted that 
such 'technology, though available, is not used in either reading or 
math. The remainder of the teacherk reported that they use instruc- 
tional technology to' varying degrees; for both reading and math, this 
use ranges from once a year, to several times a year, to once or twice 
a month, to; a few times a week. Each of tjhese categories of use 
accounts for approximately 5 to 10 percent of the population for both 
reading* and math. f * 

Similar patterns of teachers' response, for reading and math, 
emerged for such resource possibilities as working with other teachers 
.for planning and developing tests and other evaluation assignments, and 
for the availability of specialists to whom students can be sent for 
special work; however, in the case of "specialists, many more teachers 
report frequent use of this resource, for both reading and math, than is 
'the case in come of the other resources discussed above. 

Most of the teachers reported that alternate published or. teacher- 
made materials are available and quite frequently lised for students 1 
special needs. ' ' % i " ' 

* " 102 . 



96 



-A 



. ERIC 



in terms of the three remaining resources queried on the survey--* 
having someone available to read and/or grade tests and other [student- > 
assignments; quick, computerized scor+ng and analysK of\tests; and 
"item banks" to draw upon in niakfng up-Jteacher tests— a clear^tsture. 

emerges; these resources are simply not available to the vast majority 

V • 

of the respondents iVeilher reading or math (Jhe negative response 
rate range from approximately 65 to 75 percent of the respondents)* 

District Assistance * „ 

1 * 

The next item asked about the district pi^vision "of help to teachers 

• > * 

in matters related to student assessment. Approximately 300 teachers 

responded to the questions associated with this item. 

> 

In terms of receiving help in the administration of required tests, 

248 teachers (82%) indicated that such help is available; for 18 percent 

' • v 

of the teachers it is not' available. Of the teachers receiving this kind 
of help, most indicated that it<is relevant or very relevant to their 
specific classroom' tasks. 

Two hundred fifty-six of the teachers (84%f : receive assistance in 
analysis and' explanation of test results-; the remaining 16 percent do not. 

Of those receiving this help, again most of thenrhoted that it is relevant 

' /• - 

to their classroom work. 

f ' - * j 
The picture reflecting teachers receiving assistance in alternative 

ways' (other than tests) to assess student achievement is qu>te clear-cut; 

50 percent of the sample recefve this kind of assistance and the other 50 

percent do not. Of those receiving this help, most feel -that it is 

relevant to their classroom responsibilities. 



^ A 



." A similar picture emerged with respect to preparing students 
for particular* kinds of tests; ar little more than 50 percent of the 
teachers do not receive this h&lp, and a little less than 50 percent 
do. Of those teachers who do, again they find it relevant. 

Almost 60 percent of the v teachers repoft deceiving assistance in 
interpreting and using the results of different 4 kinds of tests; the 

remaining* 40 percent do not. Again, most teachers who receive this 

\ 

assistance find it useful. 

6 About half the respondents receive help in ways to tie their -teaching 

* /* 
content to that of required tests add 'half do not. Again, those rece^y^ng 

this assistance find it useful and relevant to their classroom work. 

In terms of help in Constructing or selecting good tests, the vast 

majority (approximately 85%) do not receive this kind of assistance; of 

those who do, most find it relevant or very relevant to their class-* 

room work. Similarly, most of the teachers (65%) do not receive training 

in the use of assessment results to improve the instructional program, 

but those who do find it relevant. 



. J 



District Training/College Courses 

j 

Of the 100 teachers responding to the next item on the survey, 
approximately 50 percent indicated that in the last two years they have 
attended one to five hours of meetings on the topic of selecting or 
constructing tests or establishing district testing policies. ' Another* 
25 percent noted that thley have attended six to ten hours of such * 
meetings. These jlata^tend tlfkorro borate the item discussed immediately 

fi 

above.' That is i only 100 of our teachers, receive district training in 
^constructing or selecting good tests, and'x>f these, .training has amounted 



98 



to only three or four hours for, 50 percent of the- recipients Snd may ( 
haye dealt as much wjth testing policy as with test selectton/cofcstruc- N 
tion. In terms of district Inservice on other topics related to student 
assessment, most, of the 183 respondents to this item indicated such 
inservice in the range of one to ten hours in the past two years. 

Sixty-six teachers reported taking college courses in the last 
two years that were devoted exclusively to* student assessment. Of these 
teachers, about 3D percent have taken two to five hpurs; 20 percent six 

i 

to ten hours; 10 percent 12 to 15 hours; 6 percent have •taken. 1,6 to 20 
hours; 14 percent 30 to 35 hours; and 4 percent 35 to 40 hours. The 
remaining 15 percent or so reported taking college courses on assessment 
"in. excess of 45 houfs. 

pi strict Uses of Assessment Information 

The next item on the survey asked a series of questions' about 
school administration uses of assessment information vis-a-vis teachers 1 

V 

instructional practices. Approximately 300 elementary teachers responded 
to this item. Ih terms of school administration review of test scores 
with teachers for the purpose of identifying curriculum content areas 
needing extra emphasis, 35 percent of the respondents indicated that 
such practice is a regular occurrence and part of the school's routine 
procedure, while another 25 percent 4 indicated that* it happens quite 
frequently but not on a routine or regular basis. For the remainder, it 
happens rarely (28%) or^not at all (14%). 

FOr. almost 30 percent of the teachers, a school administrator 
observes the teachers, reviews his/her instructional plans, etc., on a 
regular/routine basis to make sure that students 1 needs as indicated by*** 
test scores are. emphasized; for another 25 percent this happens quite 

i05 



often but is not regular or routine. Of the remainder, -about half - 
reported that such practice happens rarely and half that it does not 
happen at all . * 

In' terms of teachers being required to turn in the scores or 
grades of tests or assignments that they j-outineiy give, .about 15 percent 
of the respondents indicated this happens regularl-y and routinely; for. 
another 5 percent, Ithis practice .goes .on, quite often but not. on a ' 
routine basis. For about 1'5 percent this happens rarely, and for the 
remaining 65 percent it does not happen at all. • 

The final question in this series asked teachers whether their ' 
school evaluates their teaching on,the basis of students' test scores, 
/and/or establishes test score goals for the students and the. teacher 
to. meet. For the vast majority of ' the population (70%) this practice 
is not followed. Of the remainder, about 1.7 percent indicated that it 
happens rarely; about 6 percent each reported this practice as ^either 
regular and routine, or frequent but neither regular nor routine. ^ 

District Reporting of Test Results 

* 

The last of the background questions asked teachers a series of 
questions dealing witfTte£t turn-around time, usefulness of .tes4; 
reporting- formats,* and encourat^ment to, teach in the basic skil.ls. 

Of the 300. or so teachers who responded, 133 (.44%) indicated that 
they receive test results from the district soon enough that they can 
use the results for instructional modification;- another 139 (.46%'). noted 
that they receive the results too slowly to-be of use. in modifying 
teaching; the remaining 10 percent inaMcated that the question does not ' 
apply. • * ' 

f 

. ' ' > 106 Y * 



100 



In terms of the district reporting test results in a way that 
enables the teachrer to use ttiem, the vast majority (72%) indicated that, 
they receive results that are detailed* and A in a useful format (though' 
perhaps they arrive too late for this potential to be realized, as .was 
suggested immediately above). Another 21 percent answered that little 
useful information is provided in the way of reported test scores; the 
remainder indicated that the question does not apply, ' 

In connection with an assessment program and district encouragement 
of teacher emphasis on the teaching of basic skills, 95 percent of the- 
respondents indicated that their districts do follow this practice. 



Teacher Attitude^ Tbward Tes ts and" Test-Related Issues 

■ ■ . , . j 

Approximately 300 elementary school teachersVesponded to a series 
of items, probihg teacher attitudes .toward t$st>s and test-related issues. 
Table 13 below illustrates the more prevalent trends emerging from- these 
items. N " 



> 



Table 13 



Teacher Attitude Toward Tests andr Test-Related Issues 
1 ; : : tN = 300) 



Item 



Percentage of Teachers 
in Agreement 



•Testing motivates my students to study 
harder. 

> ' » 

Commercial tests are usually of high 
quality. 

The content. (or skills) on mos't 
required tests is very similar to the 
cohtent (or skills*) that I teacb. 

The pressure that testing exerts on the 
schools has a generally beneficial 
effect. ' - 

R6cently,*/I have been spending more 
teaching time preparing my- students to 
take required tests. „ ^ 

lot 



60 
60 
75 

45 

50 



I' 



V 



Table 13 
(continued) 



. 9 • 

■ERIC 



Item 



The tests developed in our district are,, 
very good. 

The. curriculum today demands more 
feomplex student thinking than in the- 

,p. ast - ;* 

TeacHe£§ should not be held accountable 
for students* scores on standardized, 
achievement tests or tests of minimum 
competency. \ % 

. In our school, students are more rigidly 
tracked tjiari* they -were • two or three' years 
ago* . / s 

Tests of minimum competency/proficiency/ 
functional literacy should be required of 
all students for promotion at certain ■ 
. grade levels or for high school gradua-" 

Tests of minimum competency are frequent - 
- *ly unfair; to particular students^ - 

. As a result of minimum' competency tests 
(and similar programs) parents are y 
l ' contacting schools about their children 
more frequently or in greater numbers. 

Tests of minimum competency' have affected 
(would affect )^the amount of time I can 
-spend .teaching subjects or skills that 
tf^e tests do not cover. 

In our schobl, testing? prpgrams are 
generally held to .be much less important 
than the social problems with which we 
are concerned, v 

Basic Skills* teaching (including remedial 
work) is now cpnsuming a substantially 
• increased proportion sf our school's 
educational Vesoruces* * » 

-The proportion of our sohool's resources 
now allocated to bas,ic skills teaching is 
so great as to detract from the quality 
of our total educational program. . 



Percentage of Teachers 
in Agreement 



. 60 



70 



70 



55 



80 



55 



55 



- V 



60 



\ s . 



40 



90 



20 



108. 



4 - i 
From the above patterns, it appe|rs that teachers see some beneficial 

effects accruing from testing and tes^ -related matters, and that the tests 

they speak about are frequently seert|to be of generally high quality and 

match what they teach. Many of the'teachers see the impefrtance of 

mi njmum 'competency tests, al though^ore than Half of our respondents 

have reservations about the fairness of such tests for Certain kinds of 

v 

students, .Perhaps, as* a function ipf minimum pompetency tests, many 

teachers report that their studertts are more rigidly tracked, than was the 

case in the recent past, which might boncern the majority who believe 

^that today's curriculum is more complex and demanding .upon the sftident * 

* * 
than 4 was previously the case, 

, Basii. skills teaching ap])ecirs to qonsume an increasing prdpdrtion of 

school resources for the majority of teachers, and affects tfie'amouirt- o.f 

time, for more than half of the sample'; that they can devote to other v 

.subjects. More than half the teachers^state that their testing programs 

are held to be more important than the social problems with which they 

are concerned. However, the majority do not tjelieve that the proportion 

of school r fcsounces given over to basic, skills is so high that H detracts 

from the vquality of the total educational program. \\ ' 

\ * " 
Teacher Uses of Assessment Re suits . 

Table 14 following provides a summary of the elementary school 

"teachers 1 responses to a 1 series of questions on their use of various 

kinds of information for. specific decision-making purposes. Jhese 

decision areas were concerned .with the importance of different kinds of 

information ;for: 0) planning teaching at the beginning of the school 

year; C2) for initial grduping or placement of students for instruction; 



Table 14 

- Teacher Use of Assessment Information for Different Decision-Making Purposes 
•(Percentages reporting use of this information for the specified pOrpose) 

/•• * 4 



Source/Kind of .Information ' 

Previous teacher's comments, reports, 
grades m . 

Students 1 standardized test $cores 

Student^' scores on district continuum 
on ;mininjum -competency tests* 



My previous teaching experience 

ResuU? of tests included with 
curriculum being us-ed 

Results pf~other special- placement / 
teats 

. Results- of special tests developed 
or chosen by my school 

Results of tests I malce up 

My own observations and student s r ? 
cla$sroom work - ~ " 



Planning Teaching 
at Beginning of 
School Year 



Reading 



61 

57 
53 

96 



/ 



■J - 



9 

;>ERIC 



110V 



Math 
. 55 

. 56 
50 • 



95 



Initial^Gfouping 

of Students » . 
— ~ — ~ — ^ 

Reading * ' Hath 



• 62 

£ ,55 
52 



77 
62 



83 
97 



56- 

53 
50 



69 
6b 



88 

» 

98 
\ 



Changing a Student 
from One Group or 
"Curriculum to 
Another 



heading 



54 



Math' 



52 



84 



54 
81 

v 

99 



J* 



85 

N 

VflO 



Deciding on 
Students' Report 
Card Grades 

\Reading . Math 



19 
22 



. 17 
/ . 

20 



72 



73 



y 

. 38 . 
93 
98 



39 
96 
98 



o 

CO 



111 




(3) for making decisions to change a -student' from one^roup or curriculum 
to another, or to provide remedial or accelerated instruction; and (4) for 
making decisions on students' report card grades. The data appearing in 
the table indicate the percentage of teachers who'rlted a given 

information source as crucial or important. Response rates ranged froifr 

"7 ' - p 

260 to 300, approximately. . t'-a .* 

Several collisions seem to be warranted, at least tentatively, on 
the basis of these aata. Jor example, whether a. respondent is describing' 
assessment information use for reading or math, the relative weight 
teachers ascribe to a^giyen kind of information remains fairly constant 
in the decision-makinjg , process. 

In terms of decisions about planning for instruction, 1t is 
that the individual teacher's previous classroom experience is by far the 
single most' important kind of information. Students' scores on standard- 
ized and other formal tests, however, appear to be almost as important in 
this decision as comments and other information about students offered 
by their previous teachers. This finding confirms conclusions drawn, 
from previous CSE work on test use. It is interesting to note*, however, 
that for a sizeable number of teachers, a number that is sometimes .in 
excess of 50 percent «of the sample, students"' scores on standardized and 
other formal tests are important not only for initial placement decisions 
(also found in previous CSE data) but also for decisions about changing 
a student from one group to another or one curriculum to another. That 
is, for a sizeable number of elementary school teachers, formal test 
scores assume importance not only at tht beginning of the school year 
but also during the school year. This conclusion does not nAi counter 

* 4 * 

• f 

- . 112 ■ v ■ ■ 



to previous CSE findings, because information is used in conjuncti<jn \^th 

A * ( 

other kinds of 'data in the teachers 1 decision-making— again a finding 

supported in previous CSE data- Further, in terms of decisions about 

initial placement, by far the most important kind 'of information is 

teacher observations ,and students' classroom work, followed by the results 

of tests teadiers have made up themselves and the results of tests that 

come with the curriculum they use . 

An almost identical pattern appears for decisions about grouping 

t and/or instructional changes .for a student and for decisions about 

students' report card grades, with the exception that for these last two 

decisions, the weights teachers ascribe to student scores on standardised 

' and district continuum or competency tests fall off drastically • 

As we have reported previously, teachers appear to rely on multiple 

sources of information Tor making vtheir -classroom decisions* Tfie - 

t 

use of "formal" tests is more dominant earlyin the school year, and/ 
as the year' advances and different kinds of decisions about indi visual' 
students, groups, and classes have to be made, teachers seem to switch 
more to use of their own professional experiences, observations, students' 
classroom work, the results of teacher-made tests, and tests that come 
with the curriculum informing their teaching, 

m One final observation should be made about these data on teacher 
use of assessment information* The percentages Shown in Table 14*above 
reflect numbers of teachers for whom an information source is crucial or 
important for a given kind of decision* The percentages not accounted 
for in these data constitute numbers of teachers rating a given kind of 
information. as slightly important or unimportant. In those cases where 
percentages of teachers reporting an information source as important He 



1 . ' ' 

in the 50 to 60 percent' range, and therefore, 40 to 50' percent of the sampl 
are not accounted for, generally about another 25 percent of the teachers 
" find the information to be at least slightly important. Exceptions to thi 
pattern, of course, are students' scores on standardized and district . 
continuum or competency tests in making decisions about students 1 grades, 
wher^anywhere between 35 to 50 percent of the teachers find these kinds 
^of information as unimportant. 

v ' ' -v 



\ 

\ 




■ ■ ■ ■ . \ 



114 



107 

* 

THE SECONDARY SCHOOL TEACHER SAMPLE 

s- •* 

Before presenting the preliminary findings on secondary teachers' 
attitudes toward and uses of assessment information, we will again offer some 
ratevant background information on the characteristics of this population, 
aiwell as. on testing and test-related matters in their schools and districts. 

« 

Teachers' Professional Background 

Table 15 below presents,* for English and mathematics teachers, the number 

4 < 

of years they have been teaching, broken down into five-year segments. 



.u 



Table 15; 
Years of Teachi-nq 



English 
(N=l-24) 



Mathematics 
(N=117) 



Number of Years Number of Number of 

Teaching Respondents - Percent | Respondents 



Percent- 



1-5, 

6-10 

11 - 15 
„ 16 - 2ft 

21 25 

26 - '30 

31 



17 

37; 

35 
17 

tZ 

; s 

1 

124" 



hp 



13 

30 

27 
15 

10. 

5 

1 

w 



18 

28 

40* 
18 

7 

3 

3 

TTT 



15! 

24 

33 
16. 

6 

3 

3 

Too" 



The next item on the survey asked teachers how long' they had been teaching 
in their districts. Table 16 below shows the response patterns to^this item, 

v 

again in five-year" periods. . * • 



Table 16 : 



Number of Years Teaching in the Present District • ^ 



* 




Enqlish 


Mathematics ) 


Number of 




(N=123) 


4 ^ (N=117) 

t 




Years in 


Number^of 




wumDer ot 




District * » 


Respondents Percent 


Respondents 


Percent 


1-5- 


39 


32 


j . . 


30 • 


6-10 


33 


27 


• * -25 


21 


11 - 15 


32. 


27 


34 


29 


16-20 


14 




f 

15 


13 , 


21 - 25 


4 • 


3 


3 

« 


3 


26-- 30 


1 


5 • 1 


1 


1 


31 - 35 


0 


0. 


3 


3 




123 


100 ' 


117 

■ J— 


100 

> » 



•These percentages by years of service in the district are roughly the same / 
as those found for the elementary school teachers,- and indicate a similar 
degree of stability among the tWb samples. * • , . * ' 

-Responses to the question on the highest diploma or degree received by the 
secondary teachers were as follows. Of the English teadhers, 53 respondents, (43%)~ 
list a Bachelors as highest degree received, 67 C545S) a masters, and three 
teachers have received a doctorate. The mathematics teachers report that . , ; 
56 (48%) have a bachelcfrs as highest degree, 60 (51*) a masters, with one 
math teacher having obtained a, doctorate. ^ 

The year that the English and math teachers received their, degrees is Indica- 

• . L 

ted in Table 17 below. • . • - 



109 



Table 17 



* * ■ «• 

■ 

Years 


Enqlisb 
(N=124) 
Number of 
.Respondents - 




Percent' — 


>* 

• 

Ma t hpma "M re 

1 1U LMCIIIu U 1 

(N=115) 
• Number of 
Respondents 


• 

• 

Percent 


1940-45 . 


<J 




3 


1 ' 


J 


1946-50 


1 
1 




1 ' - _ 


3 % ■ 


3 


• 

1951-55- 


5 




4 


1 


1 


> 

1-956-60 


> 

* , 8 




7 _ 


7 


7 


1961-65 


1 *t 




11* 


' 17 v 


1 15 


1966-70 


26 




•20 


25 


21 


* 1971-75 


'35 




29 • — 


30 * • 


25 •/ f- 


1976-81 


32 




26 


• 31 


28 




124 




100 — 

I — J 


115 • 

» 


100 

* 



These data are again similar to those provided'by the elementary school, teachers; 
the secondary teachers show almost identical patterns of "youthful ness" or ' 
time-in-teaQ&ing. ^ * . [ 

Ninety-six of the English teachers and 91 of the math teachers reported that 
they have received additional .credits beyond thejn last degree. Both samples 
show a median value.oTT47~with one ^teacher iri each population reporting 100 
.or more extra credits or units received. Ki 



Classroom Characteristic , , • " 

Approximately 120 respondents each from the^ English teachers and the malh 

* teachers answered a series of quesUojj^oncerning their cTassroom characteristics. ^ 
^ Among these characteristics. were numbers of grades in their class, the grade in 

which teachers have the greatest numbers of. .students, the average number rbf 
students they presently have i*n their classrooms, their teaching responsibilities 



LERIC 



,117 



no 



in English and math, numbers of hours of instruction th^ provide in these 
subjects, and the range of curricular levels at which their studeffts are working. 

For the English teachers, SChrespondents (24%) indicated that they' teach . 
only one grade: 42 (34%) that they teach two grades; 34 (27%) that "they present- 
ly teach three grades; and 18 ,(15%) that^they teach four grades. For the math 
teachers, 16 respondents (14%) indicated that they presently teach only one 
grade; 27 (23%) that they have two grades; 29 (25%) that they have three grades; 
and 41 respondents (35%) that they teath four gracles. . The "modal" grades 

taught by these teachers ,*expressed as ar function. of the grades in whichTthey 

* * 

teach the greatest number of students, appear in Table 18 be-low. 





1 f 
m 

t' 


fable '18: 




\ 




"Modal" Grades Taught 




V 




Grade 9 


Grade 10 


Grade 11 


Grade 12 




English Math 


.English Math 


English Math 


English Math 


No* of Teachers 


2 0 


' 116 113 


4 2 


. 1 0 


Percent ' 


2 0 

< 


94 ; 97 '. 


3 2 - 


\ 0 ' 



For the tot a \ sample of secondary teachers, then, both English and toath, approx- 
imately 95% cluster at the tenth grade/ the -target grade of -the national survey. 

Tableia below shows the average numbers of students in the English and math 

• ^> — ,^ 

teachers* classrooms. Approximately 115 teachers responded from each sample. 



c 

Table 19 



111 



Average Numbfer of Students' in a Classroom 



English Teachers ' 



Number of 



No* of Teachers 
With This 



Mathematics Teachers 
No, of Teachers 



— Students 


Size Class 


Percent/ 


Size Class 


Percent 


10 - 15 


4 * . ' 


.4 / 


8 


8 


16 -' 20 

* 


• * 

15 


13 / 8 

/s 


10 


Q 


C '21 - 25 

* » 


38 

* 


33 "/ 


33 . ' ^ 


.30- 


J - 30 • 


40 


35 / 


41 


37 


31-35 


14 




9 


9 


• 36-40 
41 plus 


1 , 

' ' 2 


* 7 

/ / 


1 

\ I 


.1 








\ 






r - 





As was the case with the elementary teadhers responding to the survey, it appears 
that the "average" secondary teacher, whether in English- or math, teaches a 
class consisting of 26 to 30 students, that the vast majority teach classes con- 
sisting of 21 to 30 students* and -that an additional 10 percent or so teach 
classes having 31 to 35 students. 



In terms of the subjects they teach in their tenth grade classes, 123 
English teachers (99%) and 116. math teachers (99%) teach English or math only; 

mi * 

one respondent teaches both English and math irr the tenth grade. Most of the 
English teachers and most of the math teachers repor| that students in each 
of their classes receive, four to six hours of instruction/classwork each 
week in the subject area. * 

In the matter. of the range of curricular levels it which they must teach 
in their subjects, different patterns appear for the English and math teachers. 
For the English teachers, approximately 30 percent teach at only one level and 
35 percent have two levels; another 27 percent teach three different^ levels. 



lis 



112 



The remaining, seven or eight percent have four or five levels. For the math 
teachers, approximately 57 percent teach at only-one level, 23 percent at two 
levels, and 15 percent at three levels; three- percent of the math teachers ha^e 
four different levels, and two percent r.eporte'd having more thah five curricular 
levels. • . • ' 

Use of Resources * • 

The secondary teachers 1 responses (N= -approximately. 120 for each, sanjple) 
to questions dealing with resource availability and use indicate that^certain 
kinds of resources. are not ayailable^to most tenth-grade teachers, whether they 
teach English or maty. There are also some resources which apparently are 
available to most tenth-grade teachers, again Vegardless of. whether they teach 
English or math. There are one or two resources which are available to around 
half of the tenth-grade teachers regardless orsubject taught, and one or two . m 
resources for which patterns of availability appear to differ as a function of 
subject taught. v 

One of the resource options queried on the survey dealt with the availability 
of another adult under the teacher's supervision to help with small group or 
individual student work. Approximately 80 percent of the English teachers and 
85 percent of the ma*th teachers report that this resource is not available; an 
additional five percent for each sample report the resource available but not 
used. The 15 percent of the English tochers who do have and use this option 
vary in degree of use from once or twice a year, once or twice a month, to once or 
twice a week. Of the ten percent of the math teachers who do have and use this 
option, most frequently cited levels of use are once or twice a year and once or 
twice a week. ^ 

/ ■ . - ■ 



120 



113 

Fqr about 70 to 75 percent of, both the English teachers and the math^teaciiers 

responding, the resource option of dividing their students among other- teachers . 

( 

^for extra help also appears to\be unavailable. In both samples an additional 

ten percent report that this option is available but not used. For those few 

teachers* who report having and using the option, degree of iise varies from once 
» * ■ 

or twice a year, to once or twice a month, to once. or twice a week. 

Having- §pmeone to help the teacher with reading, correcting, or grading 
th.e tests or other assignments they give to students does not 'appear- to be 
available to most; tenth-grade teachers. ^Approximately 70- percent of both the 
English and math teachers report the option as unavailable,. and'aijother five to 
ten percent report the option is available but not used. Of the remaining 20 
percent or so who do have and use this resource, degree of use varies, but 

highest levels of use reported-are once or twice a week. 

fc\ ' **«. „ *» 
•\ ' . . 

There are one or two resources, on the other hand, which do appear to be 

* 

available to most tenth grade teachers. For instance, approximately 85 percent 

each of the English and the math teachers report that alternate published or ^ 

teacher-made curriculum materials are available to meet students' special needs. 
Almost all of -the English teachers use this resource, with almost half of them 

reporting weekly use. Of the with teachers, about 12 percent do not use this 

option, and the remaining 70 percent or so report most frequently using the 

option several times a year or at the weekly level. ' 

.AbPut 75^percent of the English teachers and"80 percent of the math teachers 

report that they have the option of working with other teachers to plan .and 

.develop tests or other assignments; about 10 percent of each sample report 

that they do not make use of this resource. English teachers' most "frequently 

• • » \ * - 

cited levels of use are once or twice a year and about once a week/ Math teachers 

most frequently cited degrees of use are about the same as those of the English- 

,121 



114 



teachers, with the exception that more math than English teachers report use 
of» this resource on a weekly level • 

For about half of the 'tenth. -grade teachefs, in both subject /areas, flli^ck, 

"l< ' * * 

computerized scoring ahd analysis of tests is reported as l?eing available, . 

But about 20 percent of the English teachers artd 15 percent of thetnath 



teachers report\ that th^y do not use this option- Of the teachers who do use 

\ ' ■ ' - 

the option, most English and math teachers report degree of use at a few times 

.a year; about f1 # v$ percent of "each sample report using the resource once or 
twice a month, and a few teachers report use at once a week.* 

The numbers of teachers for whom "item banks 11 of test questions art avail- 
able which they can draw upon for making up their own items are roughly similar 

Jtcrthe nCimBSrs reporting for quick scoring of tests- This resource is available 

« r * * 

for about half of. the math- teachers and for almost half (46%) of the English 

teachers. About seven percent of each- sampVe Vfeport they do not make use of 

this resource/ Degree of use varies across both populations from once or twice 

a year, to once or jtwice a month, to once-or twice a week. 

£he availability of the twp remaining resource options queried in the survey 

appears to fluctuate somewhat more than those reported above "in term^of the 

subject taught. For example, while only about 40 percent of the math teachers 

report that there are specialists outside their classroom to whom they can send 

students for special help, thiS-Qption was reported as available by, about 55, per 

cent of the English teachers. About 10 percent of each population report they 

do not make use of this option; for those who do, degree of use varies from 

once or twice a year ~ the most frequent response — to once or twice a month 

* » * 

and once or twice^a week. 



■ 115 • 

.The availability of instructional machi nes* such* as audiovisual equipment 1 

and computer terminals for students 1 Independent work* also varies by subject 

* * * . 1 

« 

taught. However* for this resource, it is only avSI+afcle for about half of the 

math teachers, while for the English teachers it is reported as available'by 

about 65 percent of the sample. About 15. percent of the matii teachers report 

they do not use this option, while less than 10 percent of the English teachers 

r 

report they do not use it. For the math teachers, most frequently cited degrees 
.of use are once or twice a year and once a twice a week; for the English teachers* 
the most frequently cited degrees of use are once or twice a year, "once or twice 

< • « 1 4 

a month/and once or twice a week. * • 

4 

District Assistance 

. The next part of the survey asked the secondary teachers a series* of questions 
* * % 

about the kinds of assistance-'their districts "provide in matters related to 
student assessment. Some .clear patterns emerge- from the responses of the approx- 
imately 120 English and 120 math teachers who responded. 

When we look at the responses of aJH the secondary teachers responding, in • 
only. one area of district assistance in assessment do the majority of teachers 
in both samples- indicate receiving help; that help is in the matter of the -district 
providing analysis"£nd explanation of state, district, or school t^t results. 
For this item, 71 percent of the English teachers and 57 percent of the math 
teachers responded that this kind of help is provided. Of the 71 percent of the 
English teachers who indicated that they do receive this help, 5a percent noted- 
that it.k relevant or very relevant to their'- Spec i fit classroom tasks; 15 percent 
that it 1s slightly relevant; and only three percent that the help, is not relevant. 
Of the,57 percent of the math teachers- receiving this'kind of'heT^$bout 40, : 
percent indicated that it is relevant or very relevant to their classfoom work; 



ERLC 



.116 



15 percent that It is at.least,slightly relevant; and one teacher that the 
help is not relevant. t 

District provision of assistance to teachers also seems to occur for the > 
administration of tests required by state, district, and/or school, but more so 
for English than math teachers. Sixty-three percent of the English teachers 
indicated that such help is available to them, but on^ 42 percent of the math 
teachers noted that it is available* Of the English teachers, about 47 percent 
responded that the help is relevant or very relevant to -their classroom work; ^ 
about 13 percent that it is slightly re 4 levanj£; and two percent that it is "hot 
relevant. Of the 42 percent of the%ath teachers who do receive this district 
help, about 27 percent responded that it is relevant or very relevant; the remain 
in'g 15 percent are almost equally divided between slightly relevant and not 
relevant* 

The two areas above are ^he only ones for which districts consistently 
make an effort, at least a^perceived by the teachers, to provide assistance 
in matters- related to student .assessment* For the remaining six items 
querying district assistance the pattern is clear; most teachers, 0 in both 
• English and math,, report that the assistance simply is not available. 

For example, when asked if their districts provide help in selecting or, 
constructing good tests, 80 percent of the English teachers and 85 percent o'f 
the math teachers reported that their districts do not. For those English and 

math teachers who do receive this assistance* most reported that it is relevant 

* \f or very relevant to them; only about three percent of each sample indicated 

*» • 

• that- it is only slightly relevant. • # 

In -the area of district help in alternate ways (other *than tests) that # 

■ ' * \ 

teachers can use to assess stucjent achievement, 68 percent of the English 

V 

teachers and 78 percent of the math*teaChers reported that it is not available. * 

9 



117 • 

Again, of those teachers receiving's help, most indicated that it is relevant 
or very relevant, and about six or seven percent that it is only slightly 
relevant. 

About 65 percent of the English teachers and 70 percent of the math teachers 
do not receive district assistance on materials that can be used to prepare 
s.tudents for particular skills to improve test-taking abilities. But of those 
teachers who do receive this help, most find it relevant ^or very relevant; about 
eight to. 10 percent of them find it slightly relevant/ u [j 

\ Almost 65 percent of the English teachers and about 70 percent of the math 
teachers indicated that there is no district assistance in teacher interpretation 

9 - 

and use of different types, of tests and their applications. But once again, most 
teachers who do receive- this assistance note that it is relevant, or very relevant 
to their Classroom work; a .few indicate that it'is at least slightly relevant. 

In the matter of tying what they teach to the kinds of skills or content 
cover el on/required tests, 60 percent of the English teachers and somewhat more « 
than 70 percent of the math teachers do not receive this kind of help from their 
di struts* Again, those who do, find it mostly relevant or very revel ant, with 
a few finding at least slightly relevant. 

Finally, 75 percent of the English teachers and almost 85 percent of the 
math teachers reported ffiat there is no district training b.help teachers use the 
result's of tests to improve their 'instructional programs. Of the teachers who 
do receive this training, most find it relevant' or very relevant to their class- 
room work,* with a few teachers rating it only as slightly relevant. 

With' the exception of some district assistance. in test administration and 
test analysis or Interpretation, then, the secondary teachers indicate that most 
of them do not receive the kinds' of assistance asked about in the survey; on the 
other ha^d, the teachers who do receive assistance- in matters related to student 



er|c 'y . , " w 



118 



assessment, by and large, appear to find it to have specific relevanc 

f 

to their classroom work* 




District Training/College Courses 

Only 58 of the English teachers and 46 of the math teachers indicated 
that they have attended district meetings on test selection/construction 
and/or district testing policies in the past two years. Based on the 
smaller numbers of teachers responding in the affirmative to a related item 
in the preceding series, we might suspect that these meetings were more 
cortcerned'with policy than with test selection^onstruction. At any rate, 
of the English "teachers attending such meetings, about 40 percent of them 
indicated that they have attended one to t five hours of such Meetings; about 
anpther 35 percent that they have attended six to 10 hours of these meetings, 



Of the math teachers responding in the affirmative, about 50 percent have 
attended one to five hours of such meetings; about another 25 percent have 
attended six to. ten hours of such meetings. 

In terms of district inservice on other topics related to student 
assessment, 63 English teachers and 45 math teachers responded that they 
have received' such inservice. Of the English teachers a little more than 
65 percent of them indicated such inservige in the range of one to five 
hours in, the past two years. Of the math' teachers, about 65 percent of tb€m 
noted that their inservice in the last two years amounts to one to five hours; 
for another 30 percent this inservice amounts to six tQ 10 hours. • * , 

Twenty -eight English teachers and 10 math teachers reported that they 

have taken college courses in the last two years that were devoted exclusive- 

ly to student assessment. For 54 percent of the English teachers, the 

courses they have taken amount to one to five hours j one .or two teacher^L 

indicate college courses in each of the five-hour intervals between six and ■ 
t ' 

60. Similar patterns hold for the math teachers responding to this item. 



119 

9 

District Uses of Assessment Information 

• The secondary teachers in the sample responded next to four questions 
dealing with school-administration uses of assessment information in 
relation to teachers! instructional practices. Again, a little more than 
120 English teacher^and just under 120 math teachers responded. to these 
questions. 

For the first question, dealing with school administration review 
of test scores with teachers to identify skills or content areas in need 
of additional emphasis, only^bout eight percent of the English teachers 
and 10 percent of the math teachers indicated that this kind of practice 
happens regularly as part of the school's routine procedures. For about 
25 percent of the English teachers but only for about 10 percent of the 
math teachers this practice happens quite often but not on a regular or 

f 

routine basis. For a little more than 40 percent of the English teachers 
and. just under 40 percent of the math teachers this happens rarely and on 
no regular basis. Finally, for about 25 percent of the English teachers 
and about 38 percent of the math teachers it does not_happen at all. 

The next question dealt with school administration observation of 
teaching, reviewing teachers' lesson plans, and/or requiring teachers to 
write reports to ensure that students' special needs, as shown by^test 
scores, are emphasized. For about 25 percent) of the English teachers 
and about 18 percent of the math teachers this' practice is regular and 
part of the school's routine procedures. -For approximately 15 percent of 
both teacher samples the practice happens quite often- but not on a regular 
or routine basis. For just over 30 percent, again' for both teacher samples 



127 



120 



the practice is rare and on no regular basis. The practice does not 

happen at all for 30 percent of the English teachers and about 37 percent 

of the math teachers; t ' 

Being required to turn in the scores/grades of tests or assignments v 

that they routinely give in class appears to be a regular and routine 

pVocedure for aboytseven percent of the English teachers and five percent 

of the math teachers; these approximate percentages hold for quite frequent - 

occurrence which is neither routine nor regular, and far rare occurrence 

on no regular basis.* In each case, the percentages are slightly higher 

for the English teachers. The practice does not happen at all for about 

73 percent of the EngT^h teachers and almost 80 percent of the math 

, t&chers. , 

The last question in this .series asked the teachers if their school 

administration evaluates their teaching on the basis of student test scores 

and/or. establishes specific test-score grades for the students and the teacher 

to meet. This practice is regular and routine for' only one percent of the 

English teachers and about three percent of the math teachers. It happens 

quite often but not on a regular or routine basis for^about seven percent of 

the English teachers a nd thr ee percent of the math teachers. It happens rarely 
and on no regular basis for seven or eight percent of each teacher sample, and 

does not happen at all for the approximately 85 percent remaining "for eachsampl 

District Reporting of Test Results 

The final background question in* the survey asked questions on test 

i 

turn-around time, the usefulness of test reporting formats, and whether 
the districts encourage teachers to emphasize- the basic skills. Response 
rates were again around 120 for each* teacher sampler. 

Approximately 32 English teachers (26%) and 28 math teachers (24#K 



121 - m 



indicated that their district returns test results quickly enough that" 
teachers .can use them for modifying their instruction. About 43 English 
^teachers (36%) and 32 math teaclfers (28%) noted that the results are 
returned too slowly for the teacher to use them-wi modifying teaching. 
Ten or 11 teachers (9%) in each sample responded that the district does 
not return their students' test results, and about 35 English teachers 

4 

(30%) and 45- math teachers (39%) indicated that the question does not 
apply. 

In response to whether the district reports back students' test 
results in a way that facilitates teachers' use of the information, 56 
, English teachers (46%) and 40 math teachers (35%) indicated tha^: detailed 
' results are provided in a useful format. This finding appears to be a 
>. little at odds with some of the responses to the items immediately above. 
>y It may be that while some teachers receive .results too late to .modify . 
instruction, they do make other uses of the information. About 31 English 
teachers (25%) and 28 math teachers (24%) responded that the district provides 
little useful information in the way of test results, and 35 English teachers' 
, (29%) and 47 math teachers (41%) that the question does not apply. 

The last question in this series asked whether the district has encouraged 
teachers, in connection with an assessment program,' $o emphasise the teaching 
of basic skills. About 107 English teachers (88%) indicated tbat x their 
' districts do follow' this practice,; while the remaining 12% that their "districts 
do^ not. For the math teachers approximately the same, percentages hold. / . 

Teacher Attitude Toward Testing and Test-Related Issues 

A number of items on" the survey probed teachers'- attitudes toward testing 
,and test-related matters. English teachers' response\ates to these items 
ran- from 103 to 122; for the math teachers, response rates were from 97 to 115. 
Table 2Q below shows the percentages of English teachers and math! teachers who 



■ ERJC stroiigly atj|eed or agreed with a series of statements on the toptc of concern. 

12S 





- 122 ^ 


Table 20 : ' 


i Teacher Attitude Toward Testing and' Test-Related Issues 


\ 


Percentage of Teachers 
In Agreement 


- ItdlV 


Enqlish • Math 


Testing motivates my students, to Vtudy , 
harder. 


82 92 

» 

* 


. Commercial test? are usually of higlj 
quality. a : \ 


47 43 


The content (or skills) on most required 
tests is very similar to the content 
(or skills) that I teach. 


72 • 76 .. 

Or ♦ • 

« i 


The pressure that testing exerts on the 
( schools has a generally beneficial effect. 


59 72 


1 Recently, I have been spending more 

teaching time preparing my students to. 
tajce required tests. 


A A nil 

44 34 

*> 


The tests developed in qur district are 
very good. 


59 57 


. The curriculum today demands more 
complex student thinking^ than in the 
past. 


66 51 


Teachers should not be held accountable 
for students 1 scores on standardized 
achievement tests or tests of minimum 
competency. 


60 67 


< 

In our school, students are more rigidly 
N tracked than they were two or three 
• /\ )< 2?»s ago- 


40 \ 31 


Tests of mininlfim competency/proficiency/ 
functional literacy should be required of 
all students for promotion at certain 
~ grade levels or for high school graduation. 


90 92 . 


Tests of minimum competency are frequently 
unfair to. particular students. 


49 * 30 

- • 


As a result of mininjum competency tests 
* (and similar programs), parents are contact- 
ing schools ^bout their children more 
frequently or in greater. numbers. 


39 ' 40 - * : i 

130 ♦* •• 4 . Vi; 


© '. 
-ERIC.. 


*• . :. : ^| 



123 



Table 20: 
(continued) 



Percentage of Teachers 
. fn Agreement 



Item 



-English 



Math 



Tes1|B of minimum competency have affected 
(would affect) the amount of time I ca/i^ 
spend teaching subjects or skills thaj 
4*the tests do* not cqver.. 

In our' school, testing programs are 
generally held to be much less important 
tharNthe^social problems with which we - 
are concerned. f 

Basic skills teaching (-including remedial 
work) is now consuming a substantially 
increase^ proportion of our school's 
educational resources. 

The proportion of school's resources now 
allocated to basic skills teaching is 
so great as to detract frpm the quality 
of our total educational program. 



65 



33 



85 



33 



42 



44 



78 ' • 



Some fairly clear trends- emerge from these data. On the one hand, 
the vast majority «bf , secondary teachers from both samples state agreement' 2 
with the use of minimum competency test£ for promotion ^or graduation. 



On the other- hand, while" most math teachers do not believe 



that these 



tests are unfair to certain kinds of students, about 30 percent of them : 
do, and about 50 percent of the English-teachers* vfould' agree. 

The great majority of . both samples agree that testing motivates - 
their'students to study harder, ^yet about 60 percent of tne English 
teachersSthd 70 percent of the math teachers feel that teUhers should 
not be held accountable for students 1 scores on standardized or minimum * 
competency tests. On the other hand, sTzable numbers of teachers in 



both samples disagree, and belteve that teachers should be held account- 
able for student performance on these tests . # 

At the crux of this issue, ^perhaps, is the kind of test used, its 
purpose, and its origin. For example, the majority of both samples 
appear dubious about the- quality of corfihiercia'l tests.; greater numbers of 
teachers in both samples appear *rather*more comfortable with the tests • 
developed in their own districts. Perhaps teachers accept 'being held-' 
accountable for students' scores on locally developed arid locally 
"normed" tests. This supposition might be. borne 'out by the high levels 
' of teacher* agreement that the content or, skills on most required tests 
is similar to the content they teach,, especially- should these required 
tests be locally developed and driven by the local purriculum or 'come 
with the curriculum accepted and? used by the teachers. 

The great majority of bothcSamples agree that basic skills teaching 

' • * * ^ / 

is "now. consuming an increasing proportion of their schools 1 educational 

* « * * * * 

resources, yet do *not appear to -bejieve that* this allocation is so great 

as to detract from the quality of their schools 1 total education program. 

On the«other hand, while teachers seem to support the need to teach in 

the basic skills, some of thenr are more reserved about the curricular 

effect of minimum competency testing . For example, most English teachers 

agree that tests of minimum competency affect the amount of time they can 

spend teaching content/skills that these tests do not co;ver; this may 

suggest overemphasis of reading comprehension testing to the (detriment of 

other skills held important by English teachers* Oh .the other hand, 

, almost 60 percent of the matn^beacherS do not agree that tests of minimum 
competency affect the amount of time they can spencMieaching^ content/skill 

^not covered by the tests. Perhaps'math teachers take a different view of 



125 

subject breadth versus basis skills; perhaps they find a better fit 
between required tests and content taught. 

A somewhat varied picture of tests and testing seems to exist, 
starting with the majority agreement that testing programs are generally 
held to be more important than;;the social programs teachers are concerned 
with. Yet the majority of both samples agree that ^testing has. a -general- 
ly beneficial effect; they also agree that their schools are n^/t spending 
increasing iQg/tructional time to prepare students to take required tests, 
and that their students are not becoming more rigidly tracked. Given 
the finding that most secondary teachers believe that the curriculum today 
demands more complex student thinking than in the past, teacher perception 
about tracking is important. Rigid tracking, especially if done on the 
basis of tests not seen as accurate by teachers, migjit be seen as affecting 
their potential to stimulate students. 

Teacher Uses of Assessment Results 

> 

The secondary teachers responded to a set of questions on how they 
use various kinds of information in their decision making about students.. 
The decision concerns they responded to were the same as those queried in 
the elementary sample ~ (1) planning teaching at the start of the school 
year; (2) initial grouping t)r placement of students; (3) changing a student 
from one group or curriculum to another; and (4) assigning students 1 report 
card grades. The data in the. table following indicate the percentage of 
teachers who rated a given information source as crucial or important for 
the decision purpose. Numbers in parenthesis reflect percentages of 
teachers reporting that the assessment information is not available . 
Response rates continue to be in the range of 115 to V20 fo^r each sample. 




Table 21: 

Teacher Use of Assessment Information for Different Decision-Makiog Purposes 
(Percentages reporting use of this information for jth& specified purpose) 



Planning Teaching 
at Beginning of 
School Year 



Initial Grouping 
of Students 



Changing a Student 
from One Group 
or Curriculum 
to Another 



deciding on 
Students 1 Report 
Card Grades 



Source/Kind of Information 


EngUsh 


Math 


English • 

* 


Math 


English • 


Math 


Enqlish, 


Math 


« 

Previous teacher r s domments, 
reports* grades 


32 
(8) 


•29 * 


. • 38 ' 


38 

, I?/ 










Students 1 standardized test scores 

• 


51 
(2) 


25 
(3; 


^53' '* 
13) 

• 46 ' 
(20) 


29 


65* 


40 
Ui] 


15* 
(14) 


7* 

122) 


Students 1 scores on district continuum 
or minimum competency tests 


' 50 
(19) 


27 
(18) 


36 
(23) 


52 
(19) 


40 
(23) 


9 

^ (27) 


3* 
(27) 


My previous teaching experience 


98 


96 














Results of tests 'included with 
curriculum being used 


» 


[ 

*> 


41 ' 
(29) 


34 . 
(36) 


* 

-58 

(13) ,- 


40 
■ (25) 


45 

(17) • 


30 
(26) 


Results of other special placement 
tests 


* 


42 
(26) 


' 28. 
(34) 










Results of special tests developed or 
chosen by my school 










53 
(23) 


32 
(39) 


32 

' (25) 


26 
(33) 


« 

Results of tests I make up 






87 


72 

(9)- 


92 ' 


92 
(3) 


100 


y 98 


My own observations and students' * • 






97 


85 


99 


' 96 


99 


97 



• classroom work 

L34 



1'35 



y^ese rattngs are fo^ "important" only;, they do not reflect any "crucial" ratings. 



cn 



^.As was the case withdhe elementary school teachers,, individual 
-teachers' previous experience is by far the most important source of 
information for most. teachers as they plan instruction at the beginning 
of the school year. For the English teachers ,- 'students ' scores on stand- 
ardized tests and^ their scores on district continua or tests of minimum 
competency are held as important by about half of the sample, followed 
by previous teachers' comments with shQut 30 pefcent. In addition, for 
teachers' comments and standardized and district continua/minimum compe- 
tency tests, another 20- to' 30 percent of the English teachers find them 
to be slightly important in this decision area. Note that for students' 
scores on district continua/minimum competency tests, almost 20 percent of 
the English teachers report this kind of assessment information is not 
available to them. 

These patterns. are of the same order as those obtaining .for the 
math teachers, with the except^n that only about 25 percent of these 
raters find standardized and district continua/minimum competency test 
scores to be crucial or important. For teacher comments, another 40 
percent of the math teachers find them to be slightly important. Again, 

a sizable number of math teachers (18%) indicate that district continua/ 

> 

minimum competency test data are not available to them. 

Ijynaking their decisions about initial grouping or placement of 
students, teachers'- own observations and the results of tests they make 
up themselves are deemed most 'important by most*of the English and math 
teachers. Previous, teachers' comments are the same for both populations 
with almolt 40" percent finding them crucial or important, and another 
25 percent finding them to be slightly important. ' 

Again, as was the case with the elementary teachers, note that 
students-* scores on "formal" tests- continue to have importance for some ' 

\ 136 - 



t.eachers as they make their initial grouping decisions; this trend$is 
somewhat more pronounced for the. English teachers, especially in the^* 
case of standardized test scores* These more f6rmal measures, further, 
are slightly* important for anywhere from 35 to ,30 percent of the 
teachers depending on the particular source of information. Note Once' 
^again that for a sizable number of teachers, certain kinds of test „ ♦ 

% ' « * 

information are reported as not being available: 10 percent of the math 

* • ' * 

teachers make this statement for stapdardized tests; about 20 percent 

each for English and math report there are* no sc&r.es on district continua/ 

minimum competency. .tests-; depending on the particular measure being cited, 

anywhere from 25 to 35 percent of the teachers state there 1s no i'nforma- 

tion available from tests that are part of their curricula or frojiKOtber 

special placement tests.' While non-availability of some -of these measures 

(e.gT, standardized tests, curriculum tests) is not too surprising early 

in the year' when initial- grouping decisions are^being made,' the unavaila- 

bility of other special placements tests for a fair number of teachers 

may be noteworthy. 

The picture with rega^&^to teachers 1 decisions about changing a. Student 

from one group* or "curriculum p> another looks quit§ balanced. Once again,* 

teacher observations and results i^tjtheir own tests are the most important 

sources^of information for most teachers. Bp note that bo'th samples 

demonstrate that there is st 9 ill some reported importance for standardized , 

.tests in this decision area. Particularly for the English teachers; 

„ ^ * • * • 

standardised tests, albeit in conjunction with other kinds of assessment 

information, are still important in decisions being made once th§ school 

year is we],!, underway. Similar patterns hold 'for' "district contihua/minimum 



129 



competency tests, tests that are part bf curricular ma£eHals, and • 
results of special tests developed 0/ chosen bjy the school. And again, 
a fair numbed of teachers ^Iso report .these devices 'to be slightly T* 
important in their decision making. * 

' While some of the findings reflecting the unavailability of Qertain 
kinds of assessment information early in the school year are. not surpris- 
ing, it is a little more surprising thai so many teachers report their 
non-availability once the^hool year is underway and decisions- about 
instructional and classroom management modifications are .being made: in 
this regard, about 10 percent of the math teachers report that no stan- 
dardized test data are available; roughly 20 percent of each sample 
report that -information from district jcontinua or minimum competency 
tests is not available to thgm; almost 15 percent of the English teachers 
and 25 percent of the. math teachers report non-availability of information 
from curriculum tests; almost one .quarter of the English teachers and 
about 40 percent of the math teachers report the same for special tests 
developed or chosen by the school. 

With regard to making decisions about students 1 report card grades, 
results of their own tests and other observations o'f students remain of 
greatest importance for most* teachers. Results of curriculum tests •* 
appear next in order of importance as reflected by percentages of teachers, 
followed by results of tests developed or chosen by their school. 

Note that the indices of nonavailability of information from a given 

- % • * 

measure remain fairly constant between decisis involving student changes 

*and decisions about their .report, card grades. Tha^ is, where information 

is reported as unavailable for teacher decisions during the school year 



138 





$ * 9 ^ ■ 


- ^:%y 




- 130 

* o 
f * * 

» 

or , semester, it also appears to be equally unavailable at. or near the end 
of the year/semester. Perhaps for some teachers these measures simply 
do not exist; for others it may be that the results of certain measures 




1 

* 


are not rt^de available to teachers when they are needed for<a given 






decision; perhaps for some tests the results are administered and filed 
centrally and are never provided to teachers. The latter two cases 
might be distinct possibilities based on teachers 1 responses earlier in 
this section on the manner in which test results are returned to them. 

> • 






• 

v . ; 




* 


f t ' 






** 

K 

* 

\ 1 > o 

*/ 




i 

; ! / . * 


* 

» * " ° 
• * 

# 

* * 


V v rf ; 
l .'■•"-7 




*.'""• • . y 

• » 

•' 139 

* \ , 




S ; " d - 
AERjC 


1 * ** * - • V 
* •* * " ' 

: : - ■ v » .. 


« > 

*•* •> 









: Gorth, VLP,,;* Perkins ,^M..R. A^study of minimum competency testing „ 
programs; final, report/ Washington „ D.C.: Office of Testing, 
ftERXC' ' - '^? essm ^;* an<t Evaluation,. National Institute of Education , 1980. 



131 



REFERENCES 

1 ? 



Airasian, P.W. The effects of standardized testing and test inforr 
mation on teachers 1 perceptions and practices . Paper presented at 
the annual meeting of the- American Educational Research Association, 
San Francisco, California, 1979. 



\ Angel; J.L. National, state", and other external testing programs, 
Review of Educational Research , 1968, 38(1), 85-91. 



Baker, E.L. Achievement testing in .urban schools: New numbers . 
Paper presented at CEtoREL Conference an- Urban Education, 1978. 
(Also CEMREl3lohogr^ph on Urban Education, St. Louis, Missouri: 
CEMREL, 1980.) ' 

Bank, A. Williams, R. Evaluation in school districts: Organ iza- 
tidn*l perspectives. CSE* Monograph Number 10. Los Angeles : 
Center for the Study of Evaluation, 1981, in press. 



Borko, ti. An examination of some factors contributing to teachers ' 
preinstruCtional classroom organization and management decisions . 
Paper presented at the annual meeting of the American Educational 
Research Assocfation, Toronto, Canada, 1978. 



Boyd, J., McKenna, B.H., Stake, R.E.*, & Yashinsk.i, J. A study of 
testing practices in the ftoyal Oak' (Michigan) public" T schoois . 
Royal ^Oak, Michigan: Royal Oak City School District, 1975. (ERIC. 

Document Reproduction Service No. ED 117 161.) 

. » 

♦ < 

Carducci-Bplchazy, Ml A suryey of the use of readiness tests. Reading 
* florizon j ^ 1968, 18(3) , 209-212. 

Crew, J.L.,\& White; E.N. Criterion Referenced testing: Usages in 

some member systems of the Council of Great City Schools. Journal 
< V Negro Education , .1978/47(2), 159-167. ' ~ 

* . • ■ - 

Ebel, R.L; Improving the competence of teachers in educational 
measurement. In J. Flynn & H. Garber (Eds.), Assessing J)e7iavior: 

^ Readings in educational and psychological measurement . Reading, 
Massachusetts: Addi sonrWesl ey j 19677 ' 



i 132 



Goslin, D.A'. The use of standardized agility tests in American secon- 
dary schools and their impact on students , teachers , and administra- 
tors. New York; Russell Sage Foundation, 1965. 

Goslin, D.A., Epstein,* R. , & Hallock, B.A. The use of standardised 
tests in elementary schools. Second Technical Report. New York: 
Russell Sage Foundation, 1965. a 

jr 

Gosjin, D.A. Teachers and testing . <New York: Russell Sage Foundation, 
1967. ; • 



Hotvedt, M.O.' Teacher use of standardized tests. Health Service - 
Center at Houston, University of Texas, Houston, Texas, 1978. 

Infantfno, R. Results of the NYSEC survey on testing. The English 
Record, 1975, 26(2), 4-10. 



Jessen, R.J. Probability sampling with marginal constraints. Journal 
of the American Statistical Association , 1970,^65(330), 776-796. 



Kaufman, J.D. State assessment programs: Current status and a look 
ahead. < Denver, Colorado: Scholastic Testing Service, Inc., 1979. 



Kirkland, M. The effects of tes.ts on students and schools. Review 
of* Educational Research, 1971, 41(4), 303-350. ; 



Kitsuse, J.L, & Cicourel, A.V. The educational decisionmakers , 
Indianapolis, Indiana: Bobbs-Merrill , 1963. ' 



Leiter, K.,C.W. Ad hoeing in the schools: A study of placement prac- 
tices in two kindergartens/ In A. V.. Cicourel , (Ed. ) , Language use 
and school performance / New York: Academic Press,. 1974. 



Lortie, D.C. Schoolteacher: A sociological study » Chicago: The 
University of Chicago Press, 1975. : - 



Lyon, CO., Wscher, L., McGranahan, P., & Williams, R. Evaluation 
andsohdol districts . Los Angeles: - Center for the Study of 
Evaluation, 1978. 



141 



Mehan., H. Ethtoomethodology and education. Proceedings of the Second 
Annual Conference of the Sociology of Education Association , 1974. 



National Evaluation Systems. Propb^al submitted to the Natibqal 
Institute of Education for the reKew and evaluation of minimum , 
competency testing programs y 1978. 

Resnick, L. , & Resnick, D. The social funcHpns of educational testing. 
A fftbposal submitted to fhe Carnegie Corporation of New York, 1978. 

Rudman, H.C< The superintendentrand testing: Implications for the 
curriculum. Measurement in Education, 1975, 8(4), 1-8. 



Rudman, H.C. , Kelly, J.L., Wanous, D.S. ,\Mehrens , W.A., Clark, C.H., 
& Porter, A.C. Integrating testing with instruction. A review 
1922-1980. East Lansing Michigan : Institute for Research on 
Teaching, 1980. ' 

\ . 

Salmon-Cox, L. Teachers and tests: What's really happening? Paper 
presented at the annual meeting of .the American Educational Research 
Association, Boston, Massachusetts, 1980. ' (Also Phi Delta Kappan , 
1981, 631-634.) 



Schutz, A/ Collected Papers U The problem of social reality . 

-- M. Natanson (Ed.) -The Hague, Netherlands: Marti nus Nijhoff, 1962. 



Shavelson, R.J., Russo, N.A,, & Borko-, H. Experiments on some factors 
contributing to> teachers' pedagogical decisions. Cambridge Journal 
of Educat fr)n,. 1977, 7, 51-70.'- 



SprouR, L. , & Zubrow, D. Test information in the suburban school , 
district. Paper presented at th§ annual meeting of the American 
Educational Research Association, Boston, Massachusetts, 1980. 



Stet^ F., & Beck, M. Teachers ^opinions of standardized test use and 
usefulness . Paper presented at the annual meeting of the American 
Educational Research Association, San Francisco, California, 1979. 



Yeh, J. Test use in schools . Los' Angeles,: Center far the Study of 
Evaluation, University .of California, Los Angeles. Work Unit 4*' 
Studies in -Measurement and Methodology, "June, 1978. 



142 



